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5 

The invention encompasses methods and reagents for the 
diagnosis of a disease caused by or associated with a gene 
having a somatic mutation giving rise to a frameshift 
mutation. The methods include the steps of providing a body 
10 fluid or tissue sample from a patient; and analyzing the 
sample for the presence of a gene having a frameshift 
mutation or a protein encoded thereby, wherein the presence 
of the mutated gene or encoded protein is indicative of the 
disease. 

15 

BACKGROUND OF THE INVENTION 

Many diseases are believed to stem from somatic 
genetic mutation rather than inherited gene abnormalities, 
20 including different types of cancers and neurodegenerative 
diseases. 

Diseases which are caused by somatic mutation are 
often age-related, with an increasing incidence of somatic 

25 mutation with increasing age. For example, the prevalence 
of cancer increases with age, and it appears that some 
cancers are caused by somatic mutation. Cancer is the 
second leading cause of death in the United States, 
accounting for approximately 500 , 000 deaths (or 20 percent 

30 of all deaths) per year. These diseases are so prevalent 
that unless current trends are reversed, one in three living 
Americans will develop cancer at some time. Cancers are 
usually detected by clinical methods and cytological methods 
and are difficult to detect early. In some cases, familial 

35 cancers have been traced to inherited genes, and detection 
of such genes is believed to be predictive of susceptibility 
to the cancer. 
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It is an object of the present invention to provide 
methods and assays for detection and/ or treatment of 
diseases involving genetic mutations, particularly those 
diseases relating to aging, wherein the probability of 
5 having the disease increases with the age of the patient. 
The invention contemplates detection and/ or treatment of 
those age related diseases which are due to mutations 
occurring in the DNA of somatic cells. If the mutations are 
not corrected, the disease may result. 

10 

Another object of the invention is to treat diseases 
identified according to the invention, by providing to a 
patient afflicted with the disease or having a propensity to 
develop the disease, a corrective agent such as an enzyme or 
15 oligonucleotide . 

Yet another object of the invention is to provide a 
method for identifying age-related diseases by correlating 
nucleotide sequence mutation hotspots with the disease. 

20 

Other objects of the invention relate to 
identification, detection and treatment of age related 
diseases including cancers (especially non-hereditary 
cancers) and neurodegenerative diseases, such as Alzhemier* s 

25 Disease (AD), Parkinson's Disease (PD) , Down's syndrome, 
frontal lobe dementia (Pick's Disease), progressive 
supranuclear palsy (PSP) , amyotrophic lateral sclerosis, 
Huntington's Disease, multiple sclerosis, and other 
degenerative diseases. Early disease diagnosis is important 

30 for effective treatment. 

Alzheimer's Disease is in most cases a disease which 
is related to aging. AD is characterised by atrophy of 
nerve cells in the cerebral cortex, subcortical areas, and 
35 hippocampus and the presence of plaques, dystrophic neurites 
and neurofibrillary tangles. In most cases, it is not known 
whether AD is caused by a genetically inherited trait or by 
environmental factors, e.g., somatic mutations, or both. 
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The pathogenic mutation is unknown. 

Another object of the invention is to provide a 
diagnostic test for AD which enables definitive diagnosis of 
5 AD in living patients. Furthermore, as AD is a progressive 
disease, it is desirable to diagnose AD as early as possible 
so that preventative action may be taken. 

A number of diagnostic methods have been previously 
10 suggested for AD diagnosis, most of which have focused on 
the amyloid protein* See for example U.S. Patents 
4,666,829, 4,816,416 and 4,933,159. However, amyloid 
deposits have been found in individuals, especially aged 
persons, who have not shown signs of dementia (See J. Biol. 
15 Chem*, 265 . pp 15977, 1990; and Tables 2 and 3). Diagnostic 
tests based on the amyloid protein have therefore been shown 
to lack specificity for AD. 

In U.S. Patent 4,727,041 a diagnostic test for AD is 
20 described based on . measuring levels of somatotropin and 
somatomedin-C in blood sera following administration of an 
L-dopa proactive test. 

In International patent application WO 94/02851, a 
25 method is described for identifying AD by the use of 
antibodies having affinity for paired helical filaments in 
order to determine the levels of paired helical filaments in 
cerebral spinal fluid. The presence of paired helical 
filaments is alleged to be indicative of AD. 

30 

Other diagnostic methods are based on the 
identification of "disease specific marker proteins" in the 
cerebral spinal fluid. In International patent application 
WO 95/05604, for example, five disease specific proteins are 
35 identified by their molecular weights. However, the 
specific identity of the proteins is unknown and their 
specific relationship to the pathogenesis of AD is also 
unknown. The five "disease specific marker proteins" may 
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therefore be pres nt as a result of a more fundamental 
cellular or biochemical change. 

Another object of the invention is to provide for 
5 detection of AD preferably early on in the disease state* 
It is desirable to detect a protein or substance which is 
either directly responsible for the disease or is involved 
early on in the pathogenesis of the disease, or if not 
involved is nevertheless generated directly or indirectly by 
10 the mechanism causing the disease. Such a protein or 
substance may be the "causative" agent to the disease or may 
be "associated with" the disease state in the sense of being 
diagnostic of the disease state. 

15 Recently, Sherrington et al. in Nature, 175, pp 254- 

260 , 1995, identified a gene on chromosome 14 bearing 
missense mutations which are associated with up to 70% of 
familial early onset AD cases. A missense mutation involves 
a nucleotide substitution, usually a single nucleotide 

20 substitution, which results in an amino acid substitution at 
the corresponding codon. The missense mutations disclosed 
in Sherrington et al. are predicted to change the encoded 
amino acid at the following positions (numbering from the 
first putative initiation codon) Met to Leu at codon 146, 

25 His to Arg at codon 163, Ala to Glu at codon 246, Leu to Val 
at codon 286, Cys to Tyr at codon 410. It has been proposed 
that these mutations may be useful in identifying early 
onset AD. As stated earlier, the majority of AD cases are 
late onset (after 65 years of age; Table 1) and it is 

30 therefore still a problem to identify the majority of 
individuals having AD, particularly late onset AD. 

Presently, there are a number of substances which are 
alleged to be useful in the treatment of AD. However, so 
35 far only limited success has been achieved with these 
substances. Methods for effectively treating and/or 
preventing AD are still required (see Allen and Burns, 
Journal of Psychopharmacology , 9, pp 43-56, 1995). 
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SUMMARY OF THE INVENTION 



The present invention is based on the observation that 
5 a gene containing a frameshift mutation and encoding a 
corresponding mutant protein may be correlated with the 
presence of a disease. 

According to the present invention there is provided 
10 a method for the diagnosis of a disease caused by or 
associated with at least one gene having one or more somatic 
mutations giving rise to a frameshift mutation comprising: 
i. providing a biological sample, such as a body fluid or 
tissue sample, from a patient; and ii. analyzing .the sample 
15 for the presence of a gene having a frameshift mutation or 
a mutant protein encoded thereby, wherein the presence of 
the mutated gene or mutant protein is indicative of the 
disease. 

20 The term "somatic mutation" refers to a mutation 

occurring in a somatic or non-germline cell, and does not 
pass through the germ line and is therefore not inherited. 

A "mutant" protein is a polypeptide encoded by a mRNA 
25 at least a part of which is in a reading frame that is 
shifted relative to the initiation start codon from that of 
the native or wild-type reading frame, and thus will include 
any protein having an aberrant carboxy terminal portion 
which is encoded by the +1 or +2 reading frame of the wild 
30 type gene sequence. Thus, the mutant protein will include 
a hybrid wild-type/ nonsense protein having an amino terminal 
amino acid sequence that is encoded by the wild type (0) 
reading frame and a carboxy terminal amino acid sequence 
that is encoded by the +1 or +2 reading frame, and thus the 
35 nonsense portion of the mutant protein- The cross-over 
point b tween the wild type and nonsense amino acid 
seguenc s is the codon containing the frameshift mutation. 
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The invention is based on the discovery of the 
pres nee of such a mutant protein or an accumulation of more 
than one mutant protein in a tissue from a diseased 
individual, and also on identification of the mutant protein 
5 as indicative of the disease. 

The phrase "caused by or associated with" refers to 
a gene which is either fully or partly responsible for the 
disease , or a gene which is not responsible for the disease 
10 but is associated with the diseased state in the sense that 
it is diagnostic of the diseased state. 

A disease caused by or associated with at least one 
gene having one or more somatic mutations giving -rise to a 

15 frameshift mutation can be any disease including non- 
hereditary cancers, neurodegenerative diseases such as 
Parkinson's Disease (PD) , Alzheimer's Disease (AD) , Down's 
syndrome, frontal lobe dementia (Pick's Disease), 
progressive supranuclear palsy (PSP) , amyotrophic lateral 

20 sclerosis, Huntington's Disease, multiple sclerosis, and 
other degenerative diseases such as cardiovascular diseases 
and rheumatoid arthritis. Cancers treatable according to 
the invention include but are not limited to Hodgkin's 
disease, acute and chronic lymphocytic leukemias, multiple 

25 myeloma, breast, ovary, lung, and stomach or bladder 
cancers . 

A gene having a somatic mutation which leads to a 
frameshift mutation, and herein referred to as the "mutant 
30 gene", can be any gene having at least one mutation which 
leads to a frameshift mutation. 

A "frameshift mutation" refers to a deletion or 
35 insertion of one or more nucleotides within an open reading 
frame, for example, a single nucleotide or dinucl otide 
deletion or insertion, such that the reading frame of the 
coding region is shifted by one or two nucleotides. 
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Preferably, the frameshift mutation is a nucl otide or 
dinucleotide deletion leading to a + 1 or +2 frameshift 
mutation* However, any number of nucleotide deletions can 
occur provided a frameshift mutation results. 
5 Alternatively, the insertion of one or more nucleotides may 
give rise to a frameshift and such mutations also form part 
of the present invention. 

Other genetic modifications which give rise to a 
10 frameshift also form part of the present invention, such as 
a change in the nucleotide sequence which leads to 
translation initiation from a different position or a 
mutation outside a coding region, such as within an Intron 
or a 5* or 3' untranslated region, which mutation may result 
15 in mis-translation and production of a mutant protein. In 
this type of gene mutation, the mutant protein would be 
completely nonsense sequences and would contain no wild- 
type sequences. 

20 It is preferred that the mutation is a nucleotide and 

more preferably a dinucleotide deletion or insertion 
associated with the nucleotide sequence GAGA of the gene; 
especially preferred frameshift mutations are associated 
with the nucleotide sequence GAG AX, where X is one of G, A, 

25 T or C; thus preferred motifs include GAG AG, GAG AC, GAGAA, 
and GAG AT. Preferably the dinucleotide deletion is an AG 
deletion. It is further preferred that the mutant gene has 
one or two dinucleotide deletions associated with a GAGA, 
GAGAG, GAGAC, GAGAA, or GAGAT leading to a + 1 or + 2 

30 frameshift mutation respectively. 

In a preferred embodiment of the invention, the 
somatic mutations occur in genes of the neuronal system, 
where the disease is a neurodegenerative disease. 

35 

The "neuronal system" is defined as any cells, genes, 
proteins or substances relating to or forming part of the 
neuronal system such as nerve cells, glial cells, proteins 
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including Tau, B amyl id precursor prot in, ubiquitin, 
apolipoprotein E4 neurofilament proteins and microtubule 
associated protein II, and the genes encoding the proteins. 

5 Where the disease is a neurodegenerative disease, 

especially AD, the preferred mutant genes of the present 
invention are those encoding the 6 amyloid precursor 
protein, the Tau protein, ubiquitin, apolipoprotein-E 4 (Apo- 
E 4 ) , microtubule associated protein II (MAP 2) and the 
10 neurofilament proteins, having a deletion, insertion or 
other modification leading to a frameshift mutation* The 
most preferred mutant genes of the present invention are 
those encoding ubiguitin, MAP 2 and the neurofilament 
proteins, having a frameshift mutation. 

15 

It is preferred that the mutation is an AG dinucleotide 
deletion associated with a GAGA, or GAGAX leading to a 
frameshift mutation. It is further preferred that the 
mutant gene has one or two AG deletions each associated with 
20 a GAGA or similar motif, leading to a + 1 or + 2 frameshift 
mutation respectively. 

The term "mutant protein" as used herein is defined 
as the protein encoded by the mutant gene of the present 
25 invention. 

It is preferred that the methods of the present 
invention are for the diagnosis of a disease caused by or 
associated with at least one gene having one or more somatic 

30 mutations giving rise to a frameshift mutation. A preferred 
disease for diagnosis by the present invention is AD, except 
the early onset AD cases found to be linked to chromosome 14 
and 1. It is further preferred that the methods of the 
present invention are for the diagnosis of late onset AD, 

3 5 especially non-familial or "sporadic" late onset AD cases. 

As used her in, "biological sample" refers to a body 
fluid or body tissue which contains proteins and/or cells 
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from which nucleic acids and proteins can be isolated. 
Preferred sources include buccal swabs, bio d, sperm, 
epithelial or other tissue, milk, urine, cerebrospinal 
fluid, sputum, fecal matter, lung aspirates, throat swabs, 
5 genital swabs and exudates, rectal swabs, and nasopharyngeal 
aspirates. 

The body fluid sample can be any body fluid which 
contains cells having the somatic mutation which gives rise 

10 to the frameshift mutation and causes the diseases. When 
the disease is a neurodegenerative disease it is preferred 
that the body fluid sample contains cells of the neuronal 
system or the products of such cells. When the disease is 
a neurodegenerative disease, the preferred body fluid is 

15 cerebral spinal fluid which can be obtained after a lumbar 
puncture (Lannfelt et al., Nature Medicine, i, pp 829-832, 
1995). Another preferred body fluid is blood, as it is 
easily obtained and contains lymphocytes which can be 
analyzed for the presence of the mutant gene or encoded 

20 protein. 

The tissue sample can be any tissue and is preferably 
one that can be easily obtained, such as skin and nose 
epithelium. 

25 

Preferably, when analyzing the sample for a mutant 
gene, a nucleic acid probe is used. The nucleic acid probe 
is preferably a nucleotide probe having a sequence 
complementary to part of the mutant gene encompassing the 
30 mutation giving rise to the frameshift mutation. 

As would be apparent to one skilled in the art, the 
probe may be used to detect DNA or RNA in a fluid sample or 
in a tissue sample. 

35 

The present invention further provides a nucleic acid 
probe having a sequence complementary to part of the mutant 
gene encompassing the mutation leading to the frameshift 
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mutation, Th probe is preferably sufficiently 

complementary to the mutant sequence of the gene so that 
under stringent conditions the probe only remains bound to 
the mutant sequence. -Stringent" conditions are defined 
5 herein as DNA : DNA hybridization conditions which may be 
performed at 65 °C using a hybridization buffer equivalent to 
50% formamide and 0.1X SSC (see below and Evans et al. PNAS 
(1994) 9; 6059-6063, 6060). -Stringent" conditions also 
preferably include stringent washes, as described in Evans 
10 et al. (Ibid) . 

The probe may be of any length but is preferably 
between 5 and 50 nucleotides long, more preferably between 
10 and 30 nucleotides long. For example, the probe may be 
15 5, 10, 15, 20, 25, or 30 nucleotides in length. 

In a preferred embodiment the probe comprises a 
sequence complementary to a GAGA or GAG AX, having a 
nucleotide or dinucleotide deletion or insertion, and 

20 nucleotide sequences corresponding to the nucleotide 
sequences flanking the GAGA or motif in the wild-type gene. 
It would be apparent to one skilled in the art that if RNA 
was being probed for, a probe comprising a sequence 
complementary to the corresponding GAGA motif present in the 

25 RNA would have to be used. 

Methods of detecting the presence of the mutant gene 
include the polymerase chain reaction (PCR) using primers 
having a sequence complementary to the sequence either side 
30 of the mutation which gives rise to the frameshift mutation 
are used to amplify the DNA, as described hereinbelow. 

The primers used in the above PCR based method can 
vary in size from 20bp to 2-3 kb; for example, 20bp, 50bp, 
35 lOObp, 500bp, lOOObp, 1500bp, 2000bp, or 3000bp. The 
primers can be prepared by a number of standard techniques 
including cloning the sequences flanking the nucleotide 
region to be amplified or by synthesizing the primers using 
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phosphoramidite in thod. 

The present invention further provides primers for use 
in the above defined PCR based methods for the amplification 
5 of the nucleotide region containing the mutation. 

Preferably, when analyzing the sample for the mutant 
protein of the present invention an immunological test is 
employed. The immunological test is preferably based on the 
10 use an antibody molecule having specificity for the mutant 
protein of the present invention and not the wild-type 
protein. 

The present invention thus further provides an 
15 antibody molecule having specificity for the mutated protein 
of the present invention but not for the wild- type protein. 
Prefereably, the antibody is specific for the carboxy 
terminal end of the mutant protein. 

20 The present invention further provides a method for 

the diagnosis of a neurodegenerative disease comprising: i. 
providing a body fluid or tissue sample from a patient; and 
ii. analyzing the sample for the presence of a gene of the 
neuronal system having a frameshift mutation or a protein 

25 encoded thereby , wherein the presence of the mutated gene is 
indicative of a neurodegenerative disease. 

Preferably, the neurodegenerative disease is AD. 

30 The present invention also relates to methods for 

preventing and/or treating the diseases, vectors for 

preventing and/or treating the diseases and for the 

production of diagnostic reagents, compositions for 
preventing and/or treating the diseases, nucleic acid 

3 5 sequences, probes and antibody molecules for use in the 
present invention and transgenic animals. 
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The present inventi n further provid s a diagnostic 
kit for diagnosing a disease caused by r associated with at 
least one gene having one or more somatic mutations giving 
rise to a frameshift mutation comprising: i. a nucleic acid 
5 probe having a sequence complementary to part of the mutant 
gene which encompasses the mutation which leads to the 
frameshift mutation and packaging materials therefor; and 
ii. means for detecting the probe bound to the mutant gene. 

10 The present invention further provides a diagnostic 

kit for diagnosing a disease caused by or associated with at 
least one gene having one or more somatic mutations giving 
rise to a frameshift mutation comprising: i. primers for use 
in a PCR reaction, the primers having a • sequence 

15 complementary to the sequence either side of the mutation 
which gives rise to the frameshift mutation, packaging 
materials therefor, and reagents necessary for performing 
a PCR reaction and amplifying the DNA or RNA sequence 
containing the mutation; and ii. means for detecting the 

20 amplified DNA or RNA sequence containing the mutation* 

The present invention further provides a diagnostic 
kit for diagnosing a disease caused by or associated with at 
least one gene having one or more somatic mutations giving 
25 rise to a frameshift mutation comprising: i. an antibody 
molecule having specificity for the mutant protein of the 
present invention and not the wild-type protein; and ii. 
means for detecting the antibody molecule bound to the 
mutant protein. 

30 

The antibody molecule and the means for detecting the 
bound antibody molecule are as defined above. 

In a further embodiment of the present invention the 
35 diagnostic kit described above additionally comprising: i. 
an antibody molecule having specificity for the wild-type 
protein; and ii. means for detecting the antibody molecule 
bound to the wild-type protein, as a control for diagnosing 
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a disease caused by or associated with at least one g ne 
having one or more somatic mutations giving rise to a 
frameshift mutation. 

5 The present invention further provides a gene having 

one or more somatic mutations giving rise to a frameshift 
mutation which causes or is associated with a disease. 

The present invention further provides a mutated 
10 protein encoded by the mutated gene found to be indicative 
of a disease , the mutant gene having one or more somatic 
mutations giving rise to a frameshift mutation- Preferably, 
the mutant protein contains an antigenic epitope specific 
for the diseased state, examples of which are provided in 
15 Table 7. 

In a preferred embodiment of the present invention the 
mutated gene encodes a protein comprising at least part of 
the sequence designated +1 or +2 in any one of Figures 2 to 
20 9, or an immunologically equivalent fragment thereof. 

In a preferred embodiment the mutated protein 
comprises any one of the following individual sequences: 
RGRTSSKELA ; HGRLAPARHAS ; YADLREDPDRQ ; RQDHHPGSGAQ ; 
25 GAPRLPPAQAA; KTRFQRKGPS ; PGNRSMGHE; EAEGGSRS; or VGAARDSRAA, 
(Seq. ID Nos: 1-9, respectively) especially when the disease 
is a neurodegenerative disease such as AD. 

In a preferred embodiment, the antibody molecule of 
30 the present invention has affinity for the mutant proteins 
defined above. 

The present invention also relates to a method for 
treating and/or preventing a disease caused by or associated 
35 with at least one gene having one or more somatic mutations 
giving rise to a frameshift mutation. The finding of 
mutations in genes which lead to the production of mutant 
proteins, and which are indicative of a disease, has led to 
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a number of ways of treating and/ r pr v nting the diseas . 

The present invention further provides a method for 
detecting errors in nucleic acid repair mechanisms 
5 comprising: i. providing a body fluid or tissue sample from 
a patient; and ii. analyzing the sample for the presence of 
a gene having a frameshift mutation or a protein encoded 
thereby, the presence of a mutation being indicative of an 
error in the nucleic acid repair mechanisms. 

10 

The correction of the mutations found in the mutant 
genes of the present invention is therefore a valuable 
method for combatting diseases. 

15 The present invention further provides methods of 

treating a disease by administering to a patient an 
expression vector encoding one or more DNA repair enzymes. 

The present invention further provides the use of an 
20 expression vector encoding one or more repair enzymes in the 
manufacture of a composition for the treatment of a disease 
caused by or associated with at least one gene having one or 
more somatic mutations giving rise to a frameshift mutation. 

25 The present invention further provides a method of 

treatment and/ or prevention of a disease caused by or 
associated with at least one gene having one or more somatic 
mutations giving rise to a frameshift mutation, comprising 
administering an expression vector encoding one or more 

30 repair enzymes to a patient suffering from or likely to 
suffer from the disease. 

The present invention further provides an expression 
vector encoding one or more repair enzymes. 

35 

The repair enzymes ncoded by the vector can be any 
repair enzyme and/or an associated protein (see Kunkel, 
Current Biology, 1995, Vol. 5, No. 10, p. 1091-1094) ; such 
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as, for example, a G/T mismatch binding protein (GTBP or 
pl60) , which are capable of or contribute to the repair of 
a somatic mutation which leads to a frameshift mutation, 
such as a dinucleotide deletion. Preferred repair enzymes 
5 include Mut H, Mut S, Mut L, and Mut U, and the homologs 
thereof, including mammalian homologs. Such homologs 
include MSH 1-6, PMS 1-2, MLH 1 and GTBP. 

The invention also encompasses methods of combatting 
10 diseases caused by at least one gene having one or more 
somatic mutations giving rise to a frameshift mutation by 
targeting the mRNA transcript transcribed from the mutated 
gene, and correcting the mutant mRNA using ribozymes. 

15 Accordingly, the present invention further provides 

methods of treating the disease by administering an 
expression vector encoding a ribozyme in therapy. 

The present invention further provides the use of an 
20 expression vector encoding a ribozyme in the manufacture of 
a composition for the treatment of a disease caused by at 
least one gene having one or more somatic mutations giving 
rise to a frameshift mutation. 

25 The present invention further provides a method of 

treatment and/ or prevention of a disease caused by at least 
one gene having one or more somatic mutations giving rise to 
a frameshift mutation, comprising administering an 
expression vector encoding a ribozyme under the control of 

30 a promoter to a patient suffering from or likely to suffer 
from the disease. 

The present invention further provides an expression 
vector encoding a ribozyme under the control of a promoter. 

35 

The ribozyme encoded by the vector is preferably 
specific for RNA containing the mutations described above or 
for the RNA encoding a defective r pair enzyme. For 

SUBSTITUTE SHEET (RULE 26) 



WO 97/12992 



16 



PCT/IB96/01106 



example, if a d f ct in a repair enzyme has lead to the 
mutated gene, by correcting the transcribed messag from the 
repair enzyme gene the function of the repair enzyme can be 
restored • 

5 

A further method for combatting a disease caused by 
or associated with at least one gene having one or more 
somatic mutations giving rise to a frameshift mutation, is 
by gene therapy. 

10 

A vector encoding a non-mutated version of the mutated 
gene under the control of a promoter and other selected 
transcriptional or translational control elements can be 
delivered to affected or susceptible cells leading to the 
15 production of the correct protein in the cell. By 
increasing the percentage of the correct protein produced in 
relation to the mutated protein the further progression of 
the disease will be reduced, prevented and possibly 
reversed* 

20 

The present invention further provides methods of 
treating a disease by administering an expression vector 
encoding the wild-type version of the mutated gene. 



25 The present invention further provides the use of an 

expression vector encoding the wild- type version of the 
mutated gene in the manufacture of a composition for 
treating a disease caused by or associated with at least one 
gene having one or more somatic mutations giving rise to a 

30 frameshift mutation. 



The present invention further provides a method of 
treatment and/or prevention of a disease caused by or 
associated with at least one gene having one or more somatic 
35 mutations giving rise to a frameshift mutation, comprising 
administering an expression vector encoding the wild-type 
version of the mutated gene to a patient suffering from or 
likely to suffer from the disease. 
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The present invention further provides an expr ssion 
vector encoding the wild- type version of the mutated gene. 

5 The present invention also provides a vector encoding 

a wild-type version of the defective repair enzyme which has 
lead to or contributed to the presence of the mutated gene 
which causes or is associated with a disease, under the 
control of a promoter. 

10 

Preferably the wild-type repair enzyme is MSH 1-6, PMS 
1-2 or MLH 1, or an associated protein such as GTBP. 

The present invention further provides the use of more 
15 than one of the vectors of the present invention in any 
combination in therapy. 

The present invention further provides the use of more 
than one of the vectors of the present invention in any 
20 combination in the treatment and/or prevention of a disease 
caused by or associated with at least one gene having one or 
more somatic mutations giving rise to a frameshift mutation. 

The pharmaceutical compositions of the present 
25 invention in addition to the vectors as defined herein may 
additionally comprise a pharmaceutics lly acceptable 
excipient. 

The invention also encompasses host cell lines and 
30 transgenic animals wherein the somatic cells of the animal 
and the DNA of the host cell contains a transgene which is 
a mutant gene having one or more somatic mutations giving 
rise to a frameshift mutation. 

35 Preferably, the transgenic animals of the present 

invention are additionally defective in nucleic acid repair 
mechanisms. Thus, the mutated g ne specifically expressed 
in this type of transgenic animal will not be repaired. 
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The present invention further provides an animal 
having somatic cells containing at least one of the mutated 
genes of the present invention. 

5 

Preferably, the animal is a rodent, more preferably 
a rat or mouse. 

As used herein, and * expression vector" or an 
10 -expressible gene" denotes a vector containing a gene, or a 
gene which is expressible in a selected host cell. The gene 
will therefore be operatively associated with 
transcriptional and translational control sequences 
sufficient to permit expression of the gene in the host 
15 cell. Such control sequences include but are not limited to 
a promoter, and enhancer, a locus control region, a ribosome 
binding site, a polyadenylation site and a transcription 
termination site* 

20 The present invention further provides a method for 

identifying diseases caused by or associated with at least 
one gene having one or more somatic mutations giving rise to 
a frameshift mutation. The method comprises: i. providing 
the sequence of a gene suspected of being involved in the 

25 pathogenesis of a disease; ii. identifying the sequence of 
the mutant protein encoded by the gene sequence following a 
frameshift mutation; iii. preparing a probe to the mutant 
protein or a fragment thereof; and iv. probing a body fluid 
or tissue sample from a patient having the disease and a 

30 patient not having the disease, in order to find a 
correlation between the presence of the mutant protein and 
the diseased state. 

Preferably, the probe is an antibody molecule as 
3 5 defined herein. It is further preferred that the antibody 
molecule has affinity for a protein comprising at least one 
of the sequences: RGRTSSKELA; HGRLAPARHAS; YADLREDPDRQ; 
RQDHHPGSGAQ; GAPRLPPAQAA; KTRFQRKGPS; PGNRSMGHE; EAEGGSRS; 
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or VGAARDSRAA, esp cially when the disease is a 
neurodegenerative disease such as AD. 

Other features and advantages of the invention will 
5 be apparent from the following description of the preferred 
embodiments thereof , and from the claims. 

DESCRIPTION OF DRAWINGS 

10 

The invention is now illustrated in the appended 
example with reference to the following drawings: 

Figure l is a copy of a paraffin section of the 

15 frontal cortex of a female Alzheimer patient (70 years old, 
#83170, Table 2) immunocytochemically incubated with an 
antibody against a peptide predicted by the + 1 reading 
frame of BAPP (Tables 5 and 6) . Dystrophic neurites 
(arrowheads) and tangles (arrows) are clearly visible in the 

20 cortical layer III. 

Figure 2 presents the coding nucleotide sequence of 
the human amyloid A4 gene (SEQ ID No. 10) , the amino acid 
sequence of the wild-type protein (SEQ ID No. 11), the 
mutant + 1 frameshift protein (SEQ ID No. 12) and the 

25 mutant + 2 frameshift protein (SEQ ID No. 13). 

Figure 3 presents the coding nucleotide sequence of 
the human microtubule-associated protein tau gene (SEQ ID 
No. 14), the amino acid sequence of the wild-type protein 
(SEQ ID No. 15), the mutant + 1 frameshift protein (SEQ ID 

30 No. 16) and the mutant + 2 frameshift protein (SEQ ID No. 
17). 

Figure 4 presents the coding nucleotide sequence of 
the human ubiquitin gene (SEQ ID No. 18) , the amino acid 
sequence of the wild-type protein (SEQ ID No, 19), the 
35 mutant + 1 frameshift protein (SEQ ID No. 20) and the 
mutant + 2 frameshift prot in (SEQ ID No. 21). 

Figure 5 presents the coding nucl otide sequence of 
the human apolipoprotein E gene (SEQ ID No. 22), the amino 
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acid sequence of the wild-type protein (SEQ ID N . 23) , the 
mutant + 1 frameshift protein (SEQ ID No* 24) and the 
mutant + 2 frameshift protein (SEQ ID No. 25)- Information 
concerning restriction enzyme sites is also given. 
5 Figure 6 presents the coding nucleotide sequence of 

the human microtubule-associated protein 2 (SEQ ID No. 26) , 
the amino acid sequence of the wild-type protein (SEQ ID No. 
27), the mutant + 1 frameshift protein (SEQ ID No. 28) and 
the mutant + 2 frameshift protein (SEQ ID No. 29) . 

10 Figure 7 presents the coding nucleotide sequence of 

the human neurofilament subunit NF-low (SEQ ID No. 30) , the 
amino acid sequence of the wild- type protein (SEQ ID No. 
31), the mutant + 1 frameshift protein (SEQ ID No. 32) and 
the mutant + 2 frameshift protein (SEQ ID No. 33) . 

15 Figure 8 presents the coding nucleotide sequence of 

the human neurofilament subunit NF-M (SEQ ID No. 34), the 
amino acid sequence of the wild-type protein (SEQ ID No. 
35), the mutant + 1 frameshift protein (SEQ ID No. 36) and 
the mutant + 2 frameshift protein (SEQ ID No. 37) . 

20 Figure 9 presents the coding nucleotide sequence of 

the human neurofilament subunit NF-H (SEQ ID No. 38), the 
amino acid sequence of the wild-type protein (SEQ ID No. 
39) , the mutant + 1 frameshift protein (SEQ ID No. 40) and 
the mutant + 2 frameshift protein (SEQ ID No. 41). 

25 Figure 10 presents the partial mRNA nucleotide 

sequence and amino acid sequence of three human neuronal 
proteins (B amyloid precursor protein (exons 9 and 10) , Tau 
(exon 13) and Ubiquitin B (exon 2)) expressed in the 
wildtype and +1 reading frame. 

30 

DESCRIPTION 

The invention is illustrated by the following 
35 nonlimiting examples wherein the following materials and 
methods are employed. The entire disclosure of each of the 
literature r ferences cited hereinafter are incorporated by 
reference herein. 
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The present invention is based on the discovery that 
frameshift mutations occur in a single gene or number of 
genes whose product or products are mutant proteins that are 
5 associated with, and indicative of, a disease state. The 
invention is based on the recognition that the presence of 
a frameshift mutation results in a new coding sequence for 
the cell containing the frameshift mutation, and thus a new 
polypeptide (herein termed a mutant protein) which may be 
10 correlated with and thus be indicative of a disease. 

According to the present invention, diagnosis and/or 
identification of a disease caused by or associated with at 
least one gene having one or more somatic mutations which 
15 give rise to a frameshift mutation is accomplished as 
described herein* 

According to the present invention, methods for 
preventing and/ or treating the diseases, vectors for 

20 preventing and/ or treating the diseases and for the 
production of diagnostic reagents, compositions for 
preventing and/or treating the diseases, nucleic acid 
sequences, probes and antibody molecules for use in the 
present invention and transgenic animals are accomplished as 

25 described herein. 

Nucleotide deletions and other somatic mutations occur 
in genes due to a variety of reasons including errors in 
replication, as a result of recombination events, due to the 

30 presence of the mutagenic compounds, or as a result of a 
highly transcriptionally active state. Somatic mutations 
therefore regularly occur. In order to protect against such 
somatic mutations, mechanisms for correcting such mutations 
exist. The correction mechanisms involve a number of repair 

3 5 enzymes which detect and bind to the site of a mutation, and 
correct the mutation. 
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If the repair enzymes are defective or absent a number 
of problems can occur. For example, in x roderma 
pigmentosum a deficiency in the excision-repair of somatic 
mutations leads to skin disorders in the patient. More 
5 recently, it has been shown that defective repair enzymes 
can lead to predisposition for certain sporadic cancers. 
For example, hereditary non-polyposis colorectal carcinoma 
(HNPCC), See TIG, 10 (5), pp 164 to 168, (1994), for a 
review . 

10 

According to the present invention, methods for 
detecting errors in nucleic acid repair mechanisms are 
accomplished as described herein. The correction of the 
mutations found in the mutant genes of the present- invention 
15 is therefore a valuable method for combatting diseases* 

Further methods for combatting diseases caused by at 
least one gene having one or more somatic mutations giving 
rise to a frameshift mutation, include targeting the mutant 
20 gene with an oligonucleotide having a correct nucleotide 
sequence and relying on endogenous or exogenous cellular 
repair enzymes to repair enzymes to repair the mutation, or 
targeting the mRNA transcript transcribed from the mutated 
gene, and correcting the mutant mRNA using ribozymes* 

25 

Methods and reagents for disease diagnosis and 
treatment are described in more detail hereinbelow. 

30 Diagnosis of Diseases According to the Invention 

The invention relates to methods for diagnosing 
diseases caused by or associated with at least one gene 
having one or more somatic mutations which give rise to a 
35 frameshift mutation* Such diseases include but are not 
limited to cancers and neurodegenerative diseases such as 
Parkinson's Disease (PD) , Alzheimer's Disease (AD), frontal 
lobe dementia (Pick's Disease), progressive supranuclear 
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palsy (PSP) , amyotrophic lateral sclerosis, Huntington's 
Disease, multiple sclerosis, Down's syndrome, and ther 
degenerative diseases such as cardiovascular diseases and 
rheumatoid arthritis. 

5 

Somatic mutations can result in a different gene 
function and have been implicated in diseases associated 
with ageing, such as certain cancers. However, it has 
generally been assumed that non-proliferating cells do not 

10 undergo important changes at the genomic level. For 
example, it was assumed previously that genomic changes are 
mainly related to cell proliferation (Smith, Mutation 
Research, 277, pp 139-142, 1992) which for non-proliferating 
cells such as most neurons ends during early postnatal life 

15 (Rakic, Science, 277, pp 1054-1056, 1985). However, Evans 
et al., 1994, Proc. Nat. Aca. Sci. 91:6059, observed that 
somatic mutations do occur in genes of the neuronal system, 
i.e., in post-mitotic neurons. The di/di Brattleboro rat, 
which suffers from severe diabetes insipidus due to the 

20 absence of the antidiuretic hormone vasopressin (VP) , was 
the subject of the Evans et al. paper. It had previously 
been established that the VP hormone was absent in the 
Brattleboro rat due to a deletion of a single G residue in 
the second exon of the VP gene, resulting in a mutant VP 

25 precursor with an altered C-terminal amino acid sequence. 
It had also been observed that a small number of neurons in 
the di/di rat exhibited a heterozygous +/di phenotype and 
expressed an apparently normal VP gene product. In studying 
the molecular biology of the di/di rat, Evans et al. 

30 identified sequence alterations that restored the reading 
frame of the mutant VP precursor mRNA, which were based on 
a two-nucleotide deletion in a GAGAG motif. They correlated 
the presence of small amounts of normal VP gene product in 
single magnocellular neurons with a reversion of the mutant 

35 gene stemming from a frameshift mutation. Evans et al. 
concluded that, because +1 frameshift mutations are present 
in VP transcripts of both wild-type rats and di/di rats, the 
events leading to these mutations are not caused by the 
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diseased state of the di/di rat per se. Thus, Evans et al. 
did not correlate a mutational GAGAG hotspot with a disease 
state, or predeliction to a disease. 

5 In the present invention, the observations of Evans 

et al., as to reversions at GAGAG hotspots in VP transcripts 
within single neurons of the di/di rat leading to wild-type 
VP gene products, is extended and developed. According to 
the present invention, a human disease which is caused by or 

10 associated with at least one gene having one or more somatic 
mutations occurring at a mutational hotspot and which give 
rise to a frameshift mutation is identified and/or 
diagnosed. The nucleotide sequence of a gene suspected of 
being involved in the pathogenesis of a disease is- provided, 

15 e.g., from published gene sequences or from cloning and 
sequencing of a suspect gene* The amino acid sequence 
encoded by the gene is then predicted, as are amino acid 
sequences of encoded mutant proteins. Mutant protein 
sequences are predicted in +1 and +2 reading frames 

20 following a hypothesized frameshift mutation. The location 
of the frameshift mutation may be hypothesized with respect 
to certain nucleotide sequence motifs which are suspected of 
causing frameshift mutations during transcription, examples 
of such motifs including but not limited to GAGA, for 

25 example, GAGAG, GAG AC , GAGAA, and GAG AT. 

A probe is then prepared that is specific for the 
mutant protein or an immunogenic fragment thereof (such 
probes being described hereinabove for detection of proteins 

30 or protein fragments) . Depending on where the mutation that 
leads to the frameshift occurs, part of the mutant protein 
will have the same sequence as the wild-type protein and 
part of the protein will have the sequence of the mutant 
protein. Furthermore, depending on where the mutation 

3 5 occurs the mutant protein will terminate when the nucleotide 
sequence codes for a stop codon (indicated as * in the 
Figures). Thus, different mutant proteins will be produced 
depending on where the mutation occurs. 
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Alzheimer* s Disease is a representative disea6 
diagnosable and treatable according to the invention. AD is 
a neurodegenerative disease characterised by idiopathic 
5 progressive dementia and is the fourth highest major cause 
of death in developed countries. It affects 5 to 11% of the 
population over the age of 65 and as much as 47% of the 
population over the age of 85. At present there are an 
estimated 4 million patients suffering from AD in the 
10 U.S.A. (see Coleman, Neurobiol. of Ageing, 15, Suppl. 2, pp 
577-578, 1994), and an estimated 20 million Alzhemier* s 
patients worldwide. 

The clinical criteria for AD diagnosis have been 
15 defined (see Reisberg et al., Am. J. Psych. 12, pp 1136- 
1139, 1982; McKhann et al., Neurology, 34, pp 939-944, 
1984) . The early symptoms of AD vary but generally include 
depression, paranoia and anxiety. There is also a slow 
degeneration of intellectual function and memory. In 
20 particular, cognitive dysfunction and specific disturbances 
of speech (aphasia), motor activity (apraxia) , and 
recognition of perception (agnosia) can occur. 

There is not yet general consensus in a test for ante 
25 mortem diagnosis for AD due to the lack of knowledge of the 
pathogenic mechanisms involved in AD. Diagnosis of AD is 
made by examination of brain tissue. Such diagnosis is 
usually carried out on individuals post mortem. The 
diagnosis is based on the presence of a large number of 
30 intraneuronal neurofibrillary tangles and of neurit ic 
plaques in the brain tissue, in particular in the neocortex 
and hippocampus. In order to identify the various types of 
plaques (e.g. neuritic plaques), neuropil threads and 
neurofibrillary tangles, staining and microscopic 
35 examination of several brain tissue sections is necessary. 
Neuritic plaques are believed to be composed of degenerating 
axons (e.g., neuropil threads), nerve terminals and possibly 
astrocytic and microglial elements. It is also often found 
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that neuritic plaques have an amyloid protein core. The 
neurofibrillary tangles comprise normal and paired helical 
filaments and are believed to consist of several proteins. 

5 There are two major types of AD, late onset (>60 

years) and early onset (<60 years) . Approximately 75% of 
all AD cases are late onset and only 25% are early onset. 
Of the latter group 2.4% consists of the familial type of AD 
linked to chromosome 21 and 28% of the cases are considered 
10 to be linked to chromosome 14, as discussed below. In 
addition, a recent linkage to chromosome 1 has been 
established for juvenile onset (0.4%). Sporadic cases are 
the most prominent group (60%). 

15 In the most common late onset group, 10 to 30% of 

cases are considered to be related to chromosome 19, and in 
particular the apolipoprotein-E 4 gene is considered to be an 
important risk factor. The remaining late onset cases are 
non-familial or "sporadic" cases (see Van Broeckhoven et 

20 al. # Europ. Neurol., 3j>, pp 8-19, 1995 and Table 1). For 
these cases relatively little is known and previously no 
data was available which suggested a possible cause of AD. 

At present, it is unclear whether the formation of 
25 neuritic plaques and/or neurofibrillary tangles is directly 
responsible for causing AD. The formation of neuritic 
plaques, neuropil threads and/or neurofibrillary tangles may 
be a consequence of a more fundamental cellular or 
biochemical change. 

30 

Diagnostic methods of the invention will include the 
detection of nucleic acid sequences, preferably via 
procedures which involve formation of a nucleic acid duplex 
between two nucleic acid strands, i.e., a nucleic acid probe 
3 5 and a complementary sequence in DNA from a biological 
sample, or detection of a protein, preferably a mutant or 
hybrid wild-type/nonsense protein, as defined herein. 
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1. Preparation and Detect! n f DHA for enetio Screening. 

Typically, DNA is prepared from the biological sample 
by DNA extraction procedures veil -known in the art (see, 
5 e.g., Sambrook et al., 1990, A Laboratory Manual for 
Cloning, Cold Spring Harbor Press, CSH, NY), and may be 
further purified if desired, e.g., by electro-elution, prior 
to analysis. 

10 Methods of detecting a mutated gene in DNA from a 

biological sample include, but are not limited to the 
following: (1) polymerase chain reaction (PCR) followed by 
sizing gel electrophoresis or hybridization with an allele- 
specific (or seguence-specif ic) probe; (2) hybridization of 

15 the eluted DNA with a nucleic acid probe that is 
complementary to the mutated gene; (3) allele-specif ic 
oligonucleotide (ASO) PCR followed by an amplification- 
detection system (e.g., gel electrophoresis and staining or 
HPLC) ; (4) the ARMS test, in which one primer has a 

20 complementary sequence encompassing the mutation which gives 
rise to the frameshift mutation, and amplification only 
occurs if the mutated sequence is present. ; and (5) 
nucleotide sequencing. 

25 A DNA probe useful according to the invention is 

preferably sufficiently complementary to the mutant sequence 
of the gene so that under stringent conditions the probe 
only remains bound to the mutant sequence (see Evans et al., 
Proc. Natl. Acad. Sci. USA, 91:6059-6063, (1994). The probe 

30 is preferably labelled using any of the standard techniques 
known to those skilled in the art, such as radioactively 
using **P or any other standard isotopes, or using non- 
radioactive methods including biotin or DIG labelling. The 
labelled probe can then be easily detected by methods well 

35 known to those skilled in the art. 



An alternative method for detecting the presence of 
the mutant gene is via the polymerase chain reaction (PCR) . 
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Primers having a sequence complementary to the sequence 
either side of the mutation which gives rise t the 
frameshift mutation are used to amplify the DNA or RNA (if 
RNA is being detected a reverse transcriptase stage must be 
5 performed, as would be apparent to one skilled on the art) 
containing the mutation. The mutation in the amplified 
fragment can then be detected using the probe described 
above using standard techniques or by sequencing the 
amplified fragment. The advantages of using the PCR 

10 reaction is that the actual mutated sequence is obtained, 
less starting material is required and the PCR methods allow 
quantitative as well as qualitative determinations to be 
made. Quantitative determinations allow the number of 
copies of a mutated gene present in a particular - sample to 

15 be estimated, and given this information the severity of 
the diseased state can be estimated. 

Another alternative method for detecting the presence 
of the mutant gene is one in which one primer has a 

20 complementary sequence encompassing the mutation which gives 
rise to the frameshift mutation. Amplification will 
therefore only occur if the mutated sequence is present. 
Newton et al., Nucl. Acids. Res* 17:2503, 1989. The method 
has previously been used in detecting mutations in the gene 

25 responsible for cystic fibrosis, and one skilled in the art 
could easily perform this test for the detection of the 
mutant gene of the present invention. 

An example of analysis method (1) follows. The DNA 
30 is amplified, e.g., using PCR, prior to analysis. Specific 
conditions for any one PCR, i.e. a PCR targeting a 
particular sequence, or for any one multiplex PCR, i.e. a 
PCR targeting a particular set of sequences, may vary but 
will be known to a person of ordinary skill in the art. 

35 

Amplification of a mutated or wild-type nucleic acid 
sequence can be accomplished directly from an aliquot of the 
prepared DNA as follows. 
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25 Ml of DNA is aliquotted int a reaction tube 
containing 25 m! H 2 0, 50 jxl master mix (see below), 0.5 /xl 
Amplitaq (Perkin Elmer Cetus, Norwalk, CT) and 0.5 /xl UNG 
(Perkin Elmer Cetus, Norvalk, CT) . A 50 Ml master mix 
comprises 20 mM Tris HC1, pH 8.3, 100 mM KC1, 5 mM MgCl 2 , 
0.02 /moles each of dATP, dGTP, dCTP, 0.04 M»oles of dOTP, 
20 pmoles of each primer (Perkin Elmer Cetus, Norwalk, CT) , 
and 25 /ig gelatin. 

A fragment characteristic of the selected 
amplification sequence can then be visualized under 
ultraviolet light after ethidium bromide staining a 13% 
polyacrylamide gel in which an aliquot of the amplification 
has been electrophoresed. Alternatively, hybridization with 
allele-specific probes can identify the presence of 
amplified product from either the normal and/ or mutant 
alleles. 
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2. Preparation and Detecti n f Protein f r Gen tic 
Screening. 

Where the biological molecule to be analyzed is a 
5 protein, it may be desirable to release the nucleic acid 
from biological sample cells prior to protein elution, or to 
remove nucleic acid from the sample eluate prior to protein 
analysis. Thus, the sample or eluate may first be treated 
to release or remove the nucleic acid by mechanical 

10 disruption (such as freeze/thaw, abrasion, sonication) , 
physical/chemical disruption, such as treatment with 
detergents (e.g., Triton, Tween, or sodium dodecylsulf ate) , 
osmotic shock, heat, enzymatic lysis (lysozyme, proteinase 
K, pepsin, etc.) ^ or nuclease treatment, all according to 

15 conventional methods well known in the art. 

Where a biological sample includes a mutant protein, 
the presence or absence of which is indicative of a genetic 
disease, the protein may be detected using conventional 

20 detection assays, e.g., using protein-specific probes such 
as an antibody probe. Similarly, where a genetic disease 
correlates with the presence or absence of an amino acid or 
sequence of amino acids, these amino acids may be detected 
using conventional means, e.g., an antibody which is 

25 specific for the native or mutant sequence (see Table 7 for 
examples of amino acid sequences present in mutant 
proteins) . 

Any of the antibody reagents useful in the method of 
30 the present invention may comprise whole antibodies, 
antibody fragments, polyfunctional antibody aggregates, or 
in general any substance comprising one or more specific 
binding sites from an antibody. The antibody fragments may 
be fragments such as Fv, Fab and F(ab ! ) 2 fragments or any 
35 derivatives thereof, such as a single chain Fv fragments. 
The antibodies or antibody fragments may be non- 
recombinant, recombinant or humanized- The antibody may be 
of any immunoglobulin isotype, e.g., IgG, IgM, and so forth. 
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In addition, aggregates, polymers, derivativ s and 
conjugates of immunoglobulins or their fragments can be used 
where appropriate. 

5 The immunoglobulin source for an antibody reagent can 

be obtained in any manner such as by preparation of a 
conventional polyclonal antiserum or by preparation of a 
monoclonal or a chimeric antibody. Antiserum can be 
obtained by well-established techniques involving 
10 immunization of an animal, such as a mouse, rabbit, guinea 
pig or goat, with an appropriate immunogen. 
Preparation of Antibodies 

1. Polyclonal antibodies. 

15 

The peptide or polypeptide may be conjugated to a 
conventional carrier in order to increases its 
immunogenic ity, and antisera to the peptide-carrier 
conjugate is raised in rabbits. Coupling of a peptide to a 

20 carrier protein and immunizations are performed as described 
(Dymecki, S.M., et al., J. Biol. Chem 267:4815-4823, 1992). 
Rabbit antibodies against this peptide are raised and the 
sera titered against peptide antigen by ELISA or 
alternatively by dot or spot blotting (Boersma and Van 

25 Leeuwen, 1994, Jour. Neurosci. Methods 51:317. At the same 
time, the antisera may be used in tissue sections. The sera 
is shown to react strongly with the appropriate peptides by 
ELISA, following the procedures of Green et al., Cell, 28, 
477-487 (1982). The sera exhibiting the highest titer is 

30 used in subsequent experiments. 

2. Monoclonal antibodies. 

Techniques for preparing monoclonal antibodies are 
3 5 well known, and monoclonal antibodies of this invention may 
be prepared using a synthetic peptide, preferably bound to 
a carrier, as described by Arnheiter et al., Nature, 294, 
278-280 (1981). 
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Monoclonal antibodies are typically obtained fr m 
hybridoma tissue cultures or from ascites fluid obtained 
from animals into which the hybridoma tissue was introduced. 
5 Nevertheless, monoclonal antibodies may be described as 
being "raised to H or "induced by" the synthetic peptides or 
their conjugates. 

Particularly preferred immunological tests rely on the 

10 use of either monoclonal or polyclonal antibodies and 
include enzyme linked immunoassays (ELISA) , immunoblotting, 
immunoprecipitation and radioimmunoassays. See Voller, A., 
Diagnostic Horizons 2:1-7, 1978, Microbiological Associates 
Quarterly Publication, Walkersville, MD; Voller, A. et al., 

15 J. Clin. Pathol. 31:507-520 (1978); U.S. Reissue Pat. No. 
31,006; UK Patent 2,019,408; Butler, J.E., Meth. Enzymol. 
73:482-523 (1981) ; Maggio, E. (ed.), Enzyme Immunoassay, CRC 
Press, Boca Raton, FL, 1980) or radioimmunoassays (RIA) 
(Weintraub, B., Principles of radioimmunoassays . Seventh 

20 Training Course on Radioligand Assay Techniques, The 
Endocrine Society, March 1986, pp. 1-5, 46-49 and 68-78). 
For analyzing tissues for the presence of the mutant protein 
of the present invention, immunohistocheroistry techniques 
are preferably used. It will be apparent to one skilled in 

25 the art that the antibody molecule will have to labelled to 
facilitate easy detection of mutant protein. Techniques for 
labelling antibody molecules are well known to those skilled 
in the art (see Harlour and Lane, Antibodies, Cold Spring 
Harbour Laboratory, pp 1-726, 1989). 

30 

Alternatively, sandwich hybridization techniques may 
be used, e.g., an antibody specific for a given protein. In 
addition, an antibody specific for a haptenic group 
conjugated to the binding protein can be used. Another 
3 5 sandwich detection system useful for detection is the avidin 
or streptavidin system, where a protein specific for the 
detectable protein has been modified by addition of biotin* 
In yet another embodiment, the antibody may be replaced with 
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a non-immunogl bulin protein which has the pr perty of 
binding to an immunoglobulin molecule, for example 
Staphylococcal protein A or Streptococcal protein G, which 
are well-known in the art. The protein may either itself be 
5 detectable labeled or may be detected indirectly by a 
detectable labeled secondary binding protein, for example, 
a second antibody specific for the first antibody. Thus, 
if a rabbit-anti-hybrid wild-type/nonsense protein antibody 
serves as the first binding protein, a labeled goat-ant i- 
10 rabbit immunoglobulin antibody would be a second binding 
protein. 

In another embodiment, the signal generated by the 
presence of the hybrid wild-type/nonsense protein is 

15 amplified by reaction with a specific antibody for that 
fusion protein (e.g., an anti-6-galactosidase antibody) 
which is detectably labeled. One of ordinary skill in the 
art can devise without undue experimentation a number of 
such possible first and second binding protein systems using 

20 conventional methods well-known in the art. 

Alternatively, other techniques can be used to detect 
the mutant proteins, including chromatographic methods such 
as SDS PAGE, isoelectric focusing, Western blotting, HPLC 
25 and capillary electrophoresis. 

Identification of Diseases According to the Invention 

The invention provides methods for identifying 
30 diseases caused by or associated with at least one gene 
having one or more somatic mutations which give rise to a 
frameshift mutation. 

Diseases are identified according to the invention as 
35 follows. The nucleotide sequence of a gene suspected of 
being involved in the pathogenesis of a disease is provided, 
e.g., from published gene sequences or from cloning and 
sequencing of a suspect gene. The amino acid sequence 
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encoded by the gene is then predicted, as are amino acid 
sequences of encoded mutant proteins. Mutant protein 
sequences are predicted in +1 and +2 reading frames 
following a hypothesized frameshift mutation. The location 
5 of the frameshift mutation may be hypothesized with respect 
to certain nucleotide sequence motifs , examples of such 
motifs including but not limited to GAGA, for example, 
GAG AG , GAG AC, GAGAA, and GAG AT. 

10 A probe is then prepared that is specific for the 

mutant protein or an immunogenic fragment thereof (such 
probes being described hereinabove for detection of proteins 
or protein fragments) . Depending on where the mutation that 
leads to the frameshift occurs, part of the mutant protein 

15 will have the same sequence as the wild-type protein and 
part of the protein will have the sequence of the mutant 
protein. Furthermore, depending on where the mutation 
occurs the mutant protein will terminate when the nucleotide 
sequence codes for a stop codon (indicated as * in the 

20 Figures). Thus, different mutant proteins will be produced 
depending on where the mutation occurs. 

The simplest method of probing for the presence of a 
particular mutant protein is to make an antibody to that 

25 protein or an immunogenic portion thereof. An immunogenic 
fragment may be synthesized corresponding to the C-terminus 
of the predicted mutant proteins because even if the 
mutation occurred at another position in the sequence the 
probability that the derived mutant protein contains the 

30 peptide sequence is increased. Furthermore, the Oterminal 
region of a protein is more likely to form an epitope than 
other regions of the protein. 

Once a probe is made, a biological sample from a 
3 5 patient having the disease and a biological sample from a 
patient not having the disease is probed for the presence or 
absence of the mutant protein, also as described above. 
Alternatively, several probes may be prepared and the 
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combination of probes used to probe the tissue sample. The 
presence of the mutant protein in a biological sample from 
a patient having the disease and the absence of said mutant 
protein in a biological sample from a patient not having the 
5 disease indicates that the mutant protein is a marker for 
the disease or susceptibility to the disease. 

Treatment of Diseases According to the Invention 

10 The invention also relates to methods for preventing 

and/or treating diseases, vectors for preventing and/or 
treating the diseases, and compositions such as nucleic acid 
sequences and proteins for preventing and/or treating the 
diseases, which methods and compositions are useful in gene 

15 and protein therapies. 

The invention includes methods of treatment and/or 
prevention of a disease caused by or associated with a gene 
having a somatic mutation giving rise to a frameshift 

20 mutation in which a vector comprising an expressible gene 
encoding a repair enzyme or a ribozyme is administered to a 
patient suffering from or susceptible to the 
disease. Preferred diseases which are treated according 
to the invention include but are not limited to cancer or a 

25 neurodegenerative disease, especially AD, the preferred 
mutant genes of the present invention are those encoding the 
B amyloid precursor protein, the Tau protein, ubiquitin, 
apolipoprotein-E A (Apo-E 4 ) , microtubule associated protein 
II (MAP 2) and the neurofilament proteins, having a 

30 deletion, insertion or other modification leading to a 
frameshift mutation. 

Examples of genes encoding such repair enzymes include 
but are not limited to genes encoding MSH 1-6, PMS 1-2, MLH 
35 1 or GTBP. 



The invention includes methods of treatment and/or 
prevention of a disease in which a vector comprising an 
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expressible g ne encoding the wild-typ version of the 
mutated gene is administered to a patient suffering from r 
susceptible to the disease. Examples of genes encoding a 
wild -type version of a mutated gene include but are not 
5 limited to those disclosed herein and sequences disclosed in 
the figures. 

A vector comprising an expressible gene encoding a 
repair enzyme (that is, the wild-type version of a defective 
10 repair enzyme which has lead to or contributed to the 
presence of the mutated gene which causes or is associated 
with a disease) , a ribozyme, or a wild-type version of a 
mutated gene are characterized as described below. 

15 The vectors of the present invention are preferably 

nucleic acid vectors comprising RNA or DNA. The vectors may 
be of linear or circular configuration and may be adapted 
for episomal or integrated existence in the host cell, as 
set out in the extensive body of literature known to those 

20 skilled in the art. The vectors may be delivered to cells 
using viral or non-viral delivery systems. The choice of 
delivery system will depend on whether the nucleic acid 
sequence to be delivered is to be incorporated into the cell 
genome or is to remain episomal. 

25 

Vectors of the present invention additionally may 
comprise further control sequences such as enhancers or 
locus control regions (LCRs) , in order to lead to more 
controlled expression of the encoded gene or genes. LCRs 

30 are described in EP-A-0332667. The inclusion of a locus 
control region (LCR) , is particularly preferred as it 
ensures the DNA is inserted in an open state at the site of 
integration, thereby allowing expression of the gene or 
genes contained in the vector* The vectors of the present 

35 invention have wide range of applications in ex vivo and in 
vivo gene th rapy. 
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The invention also includes treatment of a disease by 
administrati n of a pharmaceutical c mpositi n comprising 
the wild-type analog of a mutant protein in admixture with 
a pharmaceutically acceptable carrier. 

5 

A pharmaceutical composition according to the 
invention will include a therapeutically effective amount of 
the wild-type analog of the mutant protein, a repair enzyme, 
or a ribozyme, in admixture with a carrier. A 

10 therapeutically effective amount is considered that amount 
which, when administered to a patient, provides a 
therapeutic benefit to the patient. Such amounts will 
generally be in the range of lOug-lOOmg of therapeutic 
protein/kg body weight of the patient, preferably 50ug- 

15 lOmg, and most preferably lOOug-lmg. 



Animal Models for Disease Diagnosis and Treatment According 
20 to the invention 

The invention also includes cell lines and transgenic 
animals for use as disease models for testing or treatment. 

25 A cell line or transgenic animal according to the 

invention will contain a recombinant gene, also known herein 
as a transgene, having one or more somatic mutations giving 
rise to a frameshif t mutation which causes or is associated 
with a disease. 

30 

The recombinant gene will encode a mutated protein 
found to be indicative of a disease. Preferably, the mutant 
protein will contain an antigenic epitope specific for the 
diseased state. The recombinant gene may encode a protein 
35 comprising at least part of the sequence designated +1 or +2 
in any one of Figures 2 to 9, or an immunologically 
equivalent fragment thereof. 
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A cell line containing a transg ne encoding a mutant 
protein, as describ d h rein, is mad by introducing the 
transgene into a selected cell line according to any one of 
several procedures known in the art for introducing a 
5 foreign gene into a cell. 

A transgenic animal containing such a transgene 
includes a rodent, such as a rat or mouse, or other mammals, 
such as a goat, a cow, etc. and may be made according to 
10 procedures well-known in the art. 

Transgenic animals are useful according to the 
invention as disease models for the purposes of research 
into diseases caused by or associated with at least one gene 

15 having one or more somatic mutations giving rise to a 
frameshift mutation, and therapies therefore. By 
specifically expressing one or more mutant genes, as defined 
above, the effect of such mutations on the development of 
the disease can be studied. Furthermore, therapies 

20 including gene therapy and various drugs can be tested on 
the transgenic animals. 

Transgenic animals of the present invention may be 
additionally defective in nucleic acid repair mechanisms. 
25 Thus, the mutated gene specifically expressed in the 
transgenic animal will not be repaired. 
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Described below is an embodiment of the invention 
involving identification of somatic frameshift mutations in 
5 genes encoding proteins which are present in neuronal 
tissue, and how such mutations are useful in diagnosis of 
certain disease states. 

The DNA sequences coding for the human amyloid A4 
10 protein, Tau, ubiquitin, apolipoprotein E4, MAP 2 and the 
neurofilament subunits low, medium and high, were obtained 
from various gene sequence databases. 

Using the sequence data, the various GAGA -motifs in 
15 the sequences were identified, and deletions were 
hypothesized and the sequences of the derived mutant 
proteins predicted, as shown in Figures 2 to 9. Both the 
sequences of the +1 and +2 frameshift mutant proteins were 
predicted. 

20 

By examining the sequences of the hypothesized mutant 
proteins, a peptide corresponding to the C-terminus of the 
hypothesized mutant proteins was synthesized. The peptides 
were synthesized using standard techniques known to those 
25 skilled in the art. The peptides having the following 
sequences were synthesized: RGRTSSKELA; HGRLAPARHAS ; 
YADLREDPDRQ; RQDHHPGSGAQ; GAPRLPPAQAA; KTRFQRKGPS; 
PGNRSMGHE; EAEGGSRS; and VGAARDSRAA. 

30 Depending on where the mutation that leads to the 

frameshift occurs, part of the mutant protein will have the 
same sequence as the wild-type protein and part of the 
protein will have the sequence of the mutant protein. 
Furthermore, depending on where the mutation occurs the 

35 mutant protein will terminate when the nucleotide sequence 
codes for a stop codon (indicated as * in the Figures). 
Thus different mutant proteins will be produced depending on 
where th mutation occurs. 
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It is predict d that mutati ns will ccur at GAGA 
motifs and the sequences of the mutant proteins predicted 
accordingly, 

5 

Peptides were synthesized corresponding to the O 
terminus of the predicted mutant proteins because even if 
the mutation occurred at another position in the sequence 
the probability that the derived mutant protein contains the 
10 peptide sequence is increased. Furthermore, the C-terminus 
region of a protein is more likely to form an epitope than 
other regions of the protein. 

The uniqueness of the synthesized peptides was 
15 confirmed by a gene sequence database search. 

Each synthesized peptide was then injected into a 
rabbit and an antibody having affinity for the peptide 
purified. The techniques used to obtain the antibodies are 
20 standard techniques known to those skilled in the art. 

The antibodies obtained were then tested on autopsy 
material of frontal cortex , temporal lobe and hippocampus of 
neuropathologically confirmed AD cases and control non-AD 
25 cases. The presence of the antibodies is determined using 
standard detection methods known to those skilled in the 
art. 

Figure 1 shows the presence of the B amyloid precursor 
30 mutant protein in the frontal cortex of an Alzheimer patient 
identified using an antibody against a peptide predicted by 
the +1 reading frame of B amyloid precursor protein. The 
antibody used had affinity for a peptide having the 
following sequence RGRTSSKELA. 

35 

The results of other immunoreactive tests perform d 
using the antibodies against the predict d peptides are 
shown in Table 2 and 3. Tables 2 and 3 also indicates the 
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neuropathological state (pr sence of plagues/ tang 1 s) of the 
various tissue samples* 

It can be seen that the presence of the mutant protein 
5 can be detected and correlates with the subject having AD. 
The presence of one or more of the mutant proteins can 
therefore be seen to be indicative of AD* 

Table 4 summaries the immunoreactivity results within 
10 the frontal cortex (area 11) , temporal cortex (area 38) and 
the hippocampus. 

Other diseases also may be correlated with the 
presence of mutant proteins, as defined herein. For 

15 example, six patients with Down* s syndrome were tested 
according to the invention. Down* s syndrome is trisomy of 
chromosome 21 which leads to over-expression of B-amyloid 
precursor protein. We noted that the frontal and temporal 
cerebral cortex and hippocampus of these patients contained 

20 plaques and neurofibrillary tangles, and hypothesized that 
such over-expression may promote accumulation of somatic 
mutations in neurons, by frameshift mutations at a GAGAG 
motif in the over-expressed 6-amyloid gene. After 
immunocytochemical staining of tissue from frontal and 

25 temporal cerebral cortex from the Down* s patients with the 
above-described antibody specific for the amyloid +1 carboxy 
terminal peptide, immunoreactivity was observed in the 
neurofibrillary tangles in 4 of 6 patients. Staining was 
absent in the frontal cortex of the matched controls. 

30 Therefore, the mutant amyloid protein is correlated not only 
with Alzheimer 1 s disease, but also with other diseases, such 
as Down* s, involving Alzheimer* s neuropathology. 

It has been found that a number of the mutations occur 
35 at GAGA motifs. Table 5 shows the presence of GAGA motifs 
in various genes of the neuronal system. The motif or, as 
can be seen from the sequences of Tau and apolipoprotein £4, 
similar motifs such as GAGAG GAGAC , GAGAA, and GAGAT may be 



SUBSTITUTE SHEET (RULE 26) 



WO 97/12992 



42 



PCI7IB96/01106 



associated with the f rameshif t mutations that lead to or are 
associat d with th diseas . The presence of the motif or 
similar motifs in other genes may indicate that they are 
relevant to a disease. It is also possible that other 
5 mutations occur that are not associated with such motifs but 
still lead to f rameshif t mutations that cause or are 
associated with a disease. 

Table 6 shows the presence of GAGAG motifs in 
10 particular genes of the neuronal system, namely fiAPP, Tau 
and Ubiquitin. This table also indicates, inter alia, the 
chromosomal location of the genes and the molecular weight 
of the longest polypeptide forms encoded by the genes and 
the predicted size of the aberrant +1 peptide with its C- 
15 terminus against which the antibodies were raised. These 
peptides were revealed in a Western blot and also identified 
with a different antibody recognizing an epitope on the 
unaffected wild-type N-terminus. 

20 

Example ? 

Selection of Antigenic Peptide 

25 Synthetic polypeptides corresponding in sequence to 

a portion of a mutant protein (whether such peptides are 
chemically synthesized or are chemically or recombinantly- 
generated fragments of a protein) , as described herein, will 
be useful according to the invention as antigenic peptides 

30 for generation of antibodies specific for a mutant protein, 
provided they possess the following characteristics. The 
synthetic peptide will include a minimum of 12 and 
preferably 15 amino acid residues, and an optimum length of 
20-21 amino acids. The hydrophilicity and antigenic index 

35 of the amino acid sequence of the hybrid wild-type /nonsense 
protein may be determined by Analytical Biotechnology 
Sciences, Boston, MA, using computer programming. Potential 
synthetic peptides useful according to the invention include 
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a stretch of 12-20 amino acids preferably within the carboxy 
terminal 100-150 amino acids of the hybrid wild- 
type/nonsense protein. 

5 The amino acid sequence of a selected peptide is 

searched in a computer database of sequences (e.g., GenBank) 
to preclude the possibility that at reasonable 
concentrations, antisera to any one peptide would 
specifically interact with any protein of a known sequence. 
10 Preferred sequences are those which are determined not to 
have a close homolog (i.e., •"close" meaning 80-100% 
identity) . 

15 

Detection of "Mutant" Protein 

20 Another embodiment of this invention relates to an 

assay for the presence of the "mutant" or mutant protein in 
a given tissue as indicative of a disease state. Here, an 
above-described antibody is prepared. The antibody or 
idiotype-containing polyamide portion thereof is then 

25 admixed with candidate tissue and an indicating group. The 
presence of the naturally occurring amino acid sequence is 
ascertained by the formation of an immune reaction, as 
signaled by the indicating group. Candidate tissues include 
any tissue or cell line or bodily fluid to be tested for the 

30 presence of the mutant protein, as described hereinabove. 

Expression of a given hybrid wild-type/ nonsense 
protein may be investigated using antiserum prepared in 
rabbits against a peptide corresponding to a carboxy 
3 5 terminal stretch of amino acids in the hybrid wild- 
type/ nonsense protein as follows. 

CMK cells or U3T3 cells are metabolically labeled with 
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35 S-methionine and extracts are immunoprecipitated with 
antiserum. If the hybrid wild- type/ nonsense protein is 
pres nt in the cells, then a protein species of 
corresponding molecular weight will be detected in GMK and 
5 U3T3 cells. The protein may be localized to the membrane, 
nucleus or cytoplasm by Western blot analysis of the 
nuclear, membrane and cytoplasmic fractions, as generally 
described in Towbin et al., Proc. Natl. Acad. Sci. USA, 76, 
4350-4354 (1979). This localization may be confirmed by 
10 immunofluorescence analysis to be associated mainly with the 
plasma membrane. 

Metabolic labeling immunoprecipitation, and 
immunolocalization assays are performed as described 

15 previously (Furth, M.E., et al., Oncogene 1:47-58, 1987; 
Laemmli, U.K., Nature 227:680-685, 1970; Yarden, Y. r et al., 
EMBO J. 6:3341-3351, 1987; Konopka, J.B., et al., Mol. Cell. 
Biol. 5:3116-3123, 1985). For immunoblot analysis, total 
lysates are prepared (using Fruth»s lysis buffer) (Fruth, 

20 M.E. , et al*, Oncogene, 1:47-58, 1987). Relative protein 
concentrations are determined with a colorimetric assay kit 
(Bio-Rad) with bovine serum albumin as the standard. A 
protein of lysate containing approximately 0.05 mg of 
protein was mixed with an equal volume of 2 x SDS sample 

25 buffer containing 2 mercaptoethanol , boiled for 5 min., 
fractioned on 10% polyacrylamide-SD5 gels (Konopka, J.B., et 
al., J.Virol., 51:223-232, 1984) and transferred to 
immunobilon polyvinyldine difluoride (Millipore Corp., 
Bedford, MA) filters. Protein blots were treated with 

30 specific antipeptide antibodies (see below) . Primary 
binding of the specific antibodies may be detected using 
anti-lgG second antibodies conjugated to horseradish 
peroxidase and subsequent chemi luminescence development ECL 
Western blotting system (Amersham International) . 

35 For metabolic labeling, 10 6 cells are labeled with 100 

MCi of 35 S-methionine in 1 ml of Dulbecco^ modified Eagles 
medium minus methionine (Amersham Corp.) for 16h. 
Immunoprecipitation of protein from labeled cells with 
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antipeptide antiserum is performed as described (Dymecki, 
S.M. , et al., J. Biol. Chem 267:4815-4823, 1992). Portions 
of lysates containing 10 7 cpm of acid-insoluble 35 S- 
methionine were incubated with 1 pg of the antiserum in 0.5 
5 ml of reaction mixture. Immunoprecipitation samples were 
analyzed by SDS-polylarcylamide gel electrophoresis and 
autoradiography • 

For immunolocalization studies, 10 7 GMK cells are 
10 resuspended in 1 ml of sonication buffer (60mM Tris-HCl, pH 
7.5, 6 mM EDTA, 15 mM EGTA, 0.75M sucrose, 0.03% leupeptin 
12mM phenylmethylsulfonyl fluoride, 30 mM 2- 
mercaptoethanol) . Cells are sonicated 6 times for 10 
seconds each and centrifuged at 25 , 000 xg for 10 min at 4°C. 
15 The pellet is dissolved in 1 ml of sonication buffer and 
centrifuged at 25,000 x g for 10 min at 4°C. 

The pellet (nucleus fraction) is resuspended in 1 ml 
of sonication buffer and added to an equal volume of 2 x SDS 

20 sample buffer. The supernatant obtained above (after the 
first sonication) is again centrifuged at 100,000 x g for 40 
min at 4°C. The supernatant (cytosolic fraction) is removed 
and added to an equal volume of 2 x concentrated SDS sample 
buffer. The remaining pellet (membrane fraction) is washed 

25 and dissolved in sonication buffer and SOS sample buffer as 
described above. Protein samples are analyzed by 
electrophoresis on 10% polyacrylamide gels, according to the 
Laemmli method (Konopka, J.B., et al., Hoi. Cell. Biol. 
5:3116-3123, 1985). The proteins are transferred from the 

30 gels on a 0.45-pm polyvinylidine difluoride membrane for 
subsequent immunoblot analysis. Primary binding of 
antibodies is detected using anti-IgG second antibodies 
conjugated to horseradish peroxidase. 

35 For immunohistochemical localization of a given 

protein, if desired, CMK cells or U3T3 are grown on cover 
slips to approximately 50% confluence and are washed with 
PBS (pH 7.4) after removing the medium* The cells are 
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prefixed for l min at 37 c c in 1% paraformaldehyde containing 
0,075% Triton X-100, rinsed with PBS and then fix d for 10 
min with 4% paraformaldehyde. After the fixation step, 
cells are rinsed in PBS, quenched in PBS with 0.1 and 
5 finally rinsed again in PBS. For antibody staining, the 
cells are first blocked with a blocking solution (3% bovine 
serum albumin in PBS) and incubated for 1 h at 37 # C, The 
cells are then incubated for 1 h at 37 °C with antiserum 
(1:100 dilution or with pre immune rabbit serum (1:100) (see 
10 below) . After the incubation with the primary antibody, 
the cells are washed in PBS containing 3% bovine and serum 
albumin and 0.1% Tween 20 and incubated for 1 h at 37 C in 
f luorescein-conjugated donkey anti-rabbit IgGs (Jackson 
Immunoresearch, Maine) diluted 1:100 in blocking solution. 

15 

The coverslips are washed in PBS (pH 8.0), and 
glycerol is added to each coverslip before mounting on glass 
slides and sealing with clear nail polish. All glass slides 
were examined with a Zeiss Axiophot microscope. 

20 
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Example 4 

Biological Sample Analysis 

5 The above methods for detection of a given mutant 

protein or nucleic acid are applicable to analyses involving 
tissues, cell lines and bodily fluids suspected of 
containing the marker protein* 

10 For example, a sample of CNS tissue suspected of being 

in a diseased state may be analyzed, it having been 
previously observed according to the invention that tissue 
of that particular diseased state contains detectable levels 
of hybrid wild-type/ nonsense proteins relative to healthy 

15 tissue. 

An aliquot of the suspect sample and a healthy control 
sample are provided and admixed with an effective amount of 
an antibody specific for the hybrid wild-type/nonsense 

20 protein, as herein described, and an indicating group. The 
admixture is typically incubated, as is known, for a time 
sufficient to permit an immune reaction to occur. The 
incubated admixture is then assayed for the presence of an 
immune reaction as indicated by the indicating group. The 

25 relative levels of the hybrid wild-type/nonsense protein in 
the suspect sample and the control sample are then compared, 
allowing for diagnosis of a diseased or healthy state in the 
suspect sample. 

30 The above types of analyzing for the presence of the 

hybrid wild-type/nonsense protein may, of course, be 
performed using analysis for the coding mRNA, e.g., via 
Northern blot or RNA dot blot analysis, both of which are 
conventional and known in the art. 

35 
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Disease Treatment According to the Invention 

Once a gene containing a frameshift mutation (i.e., 
a frameshifted gene) , or a mutant protein is correlated with 
5 a disease state, the disease is treatable according to the 
invention as follows: by administering to a patient in need 
thereof a wild-type version of a mutant gene or the 
corresponding wild-type transcript; by administering the 
wild-type version of the hybrid wild-type/nonsense protein; 

10 by administering enzymes, or sequences encoding such 
enzymes, having activity that operates to correct nucleotide 
insertions or deletions (DNA repair enzymes) ; by 
administering enzymes which serve to correct frameshifted 
RNA via splicing, e.g., ribozymes; or by administering 

15 oligonucleotides or sequences encoding oligonucleotides to 
a cell as templates for repair of an insertion or deletion 
mutation. A patient in need thereof will include a patient 
exhibiting symptoms of the disease, even those patients 
suspected of developing the disease, i.e. , who are monitored 

20 according to the invention by measuring the a tissue sample, 
e.g., the cerebrospinal fluid, for the presence of 
frameshifted peptides (e.g. peptides having an amino acid 
sequence in the +1 or +2 reading frame) . 

25 Therefore, it is contemplated according to the 

invention that the wild type version of the mutant (i.e., 
frameshifted) gene, or its encoded wild type protein, may be 
administered to the patient in order to treat the disease 
associated with the presence of the mutant gene or its 

30 encoded protein. 

According to the invention, a vector encoding a non- 
mutated version of a frameshifted gene under the control of 
a promoter can be delivered to affected or susceptible cells 
35 leading to the production of the correct protein in the 
cell. Without being bound by any one the ry, it is 
suggested that increasing the percentage of the c rrect 
protein produced in relation to the hybrid wild- 
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type/nonsense protein will reduc or prevent furth r 
progression of the disease, and possibly reverse the 
diseased state. For example, in neurons having undergone a 
frameshift nutation in one of two expressed alleles, the 
5 balance between the wild-type and mutant transcripts may be 
shifted in favor of the wild-type one as a result of the 
treatments described herein. In addition, it is possible 
that not every mutant gene and transcript results in a 
mutant protein that is directly toxic to the neuronal 

10 tissue. For example, the mutant protein may be routed to 
the lysosomal system or just secreted (e.g. by the 
constitutive or regulated pathway) and degraded elsewhere. 
However, sometimes the mutant protein will be accumulated 
in the membranes of organelles, for instance in the 

15 endoplasmic reticulum, thus disrupting the normal processes 
of the cellular machinery. This may be especially true if 
both alleles of a given gene or if different genes are 
mutated • 

20 The wild-type version of the mutated gene encodes the 

correct protein. The wild-type gene corresponds to the 
mutated gene found present in the affected cells and 
preferably the wild-type gene is expressed at high levels 
leading to the production of more wild-type protein than the 

25 mutated protein. When the disease is a neurodegenerative 
disease, preferred wild-type genes include the B amyloid 
protein gene, the Tau gene, the ubiquitin gene, the 
apolipoprotein-E^ gene, the microtubule associated protein 
II (MAP2) gene and the neurofilament protein genes. The 

30 sequences of these genes are provided herein in the figures. 
Other preferred wild-type genes include the alpha and beta 
tubulin genes, the sequences of which are found in Cowan et 
al. r Mol. and Cellular Biology, 3, 1738-1745(1983) and Lewis 
et al., J. Mol. Biol. 182, 11-20(1985), repectively. 

35 

When the disease is a non-hereditary cancer, preferred 
wild-type gene sequences include but are not limited to the 
human p53 gene and the bcl-2 gene. Mammalian phosphoprotein 
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p53 has be n shown to play an ssential role in regulati n 
of cell division and is required for the transition from 
phase GO to Gl of the cell cycle. P53 is normally present 
in very low levels in normal cells and is believed to be a 
5 tumor suppressor gene; when present at high levels, p53 has 
been shown to play a role in transformation and malignancy. 
P53 gene alleles from normal and malignant tissues have been 
shown to contain Bglll site polymorphism (Buchman et al., 
1988, Gene 70:245). The p53 coding region contains several 

10 GAGA motifs, e.g., GAGAC at position 1476 of the sequence 
published in Buchman et al. r GAGA at position 1498; GAGA at 
position 1643; and GAGA at position 1713, which motifs 
present candidate sites for frameshift mutations according 
to the invention. A frameshift mutation within the p53 gene 

15 thus may lead to loss of the natural p53 tumor suppressor 
function. Detection of such a mutation in p53 may be 
diagnostic of pre-malignancy or malignancy, and treatment 
as described herein which results in correction of p53 
function may restore tumor suppressor function. 

20 

It is also contemplated according to the invention 
that a mutant gene, which mutation comprises a somatic 
mutation giving rise to a frameshift mutation that includes 
a nucleotide deletion or insertion (i.e., of one or more 

25 nucleotides) , may be due to a deficit in one or more DNA 
repair enzymes and/or associated proteins in the cell 
containing the mutant gene. Therefore, where the disease is 
associated with a deficiency in a DNA repair enzyme, the 
disease may be treated by administering a DNA repair enzyme, 

30 or sequences encoding the repair enzyme, to the patient. 

The present invention thus further provides a method 
of treatment and/or prevention of a disease caused by or 
associated with at least one gene having one or more somatic 
35 mutations giving rise to a frameshift mutation, comprising 
administering a vector encoding one or more repair enzymes 
under the control of a promoter to a patient suffering fr m 
or likely to suffer from the disease, or delivering the 
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repair enzyme itself to a target c 11 containing the 
frameshifted gene. 

The repair enzyme can be any repair enzyme which is 
5 capable of or contributes to, the repair of a somatic 
mutation which leads to a frameshift mutation, such as a 
dinucleotide deletion. Preferred repair enzymes include Mut 
H, Nut S, Hut L, and Mut U and the mammalian homologues 
thereof. These include MSH 1-6, PMS 1-2 and MLH 1, as 

10 described in Prolla et al., Science 265:1091, 1994; Strand 
et al., 1993, Nature 365:274; Kramer et al., 1984, Cell 
38:879; Johnson et al., 1995, Science 269:238; Horii et al., 
1994, Biochem. Biophys. Res. Comm. 204:1257; Jeyaprakash et 
al., 1994, Mutation Research 325:21; J.H.J. Hoei jmakers, 

15 1987, Journal of Cell Science Suppl. 6:111-125; and Kunkel 
et al. , supra. 

The invention also encompasses methods of combatting 
diseases caused by at least one gene having one or more 

20 somatic mutations giving rise to a frameshift mutation by 
targeting the mRNA transcript transcribed from the mutated 
gene. Thus, it is also contemplated according to the 
invention that a frameshift mutation within a gene may be 
corrected at the level of the frameshifted RNA via splicing 

25 using a ribozyme having specificity for the correct mRNA 
sequence (see Denman et al., Arch. Of Biochem. Biophysics. 
323,71-78,1995), and correcting the mutant mRNA using 
ribozymes. The disease associated with the frameshifted 
gene is thus treated by administering an appropriate 

30 ribozyme, or sequences encoding the ribozyme, to the 
patient. 

Ribozymes of selected specificities may be made as 
described by Sullenger & Cech, Nature 371: 619-622, 1994), 
35 herein incorporated by reference. The ribozyme encoded by 
the vector is preferably sp cific for RNA containing the 
mutations described above or for the RNA encoding a 
defective repair enzyme. For example, if a defect in a 
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repair enzyme has 1 d to the nutated gene, by correcting the 
transcribed messag from the repair enzyme gene the functi n 
of the repair enzyme can be restored. 

5 Ribozymes and sequences encoding ribozymes may be 

prepared as described by Tuschl et al., Curr. Opin. Struc. 
Biol. 5:296, 1995 and Wahl et al., Curr. Opin. Struc. Biol. 
5:282, 1995. 

10 The invention also encompasses methods of treating 

diseases caused by or associated with at least one gene 
having one or more somatic mutations giving rise to a 
frameshift mutation by delivery of oligonucleotides or 
sequences encoding oligonucleotides to a target cell 

15 containing a frameshifted gene. The oligonucleotides will 
have a wild- type sequence with respect to the corresponding 
mutant sequence of the frameshifted gene, and thus may serve 
as templates for correction of the mutant sequence. For 
example, triple helix-forming oligonucleotides may be used 

20 to target mutations to selected genes within mammalian 
cells. Targeted mutagenesis in vivo depends upon the 
strength and specificity of the third strand binding. 
Oligonucleotides with strong target site binding affinity, 
i.e., with full target site homology (except for the base to 

25 be corrected, which base will be mismatched between the 
oligonucleotide and the nuclear DNA) are preferred. Also 
preferred are oligonucleotides between 10 - 30 nucleotides 
in length and containing a GAGA, GAGAC, GAGAG, GAGAA or 
GAGAT motif. Formation of a site-specific triple strand 

30 brings the wild type oligonucleotide sequence into proximity 
with the base pair to be mutated. DNA repair enzyme will 
then recognize the mismatch and will repair the mismatch. 
Targeted mutagenesis via triple helix formation has been 
shown to work in mammalian cells (see Wang et al., 1995, 

35 Mol. Cell Biol. 15:1759). 

Disease treatm nt according to the invention is 
described below and includes preparation of the administered 
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substance and administration of the substance to a patient 
suffering from a disease according to the inventi n. As 
used herein M substance" refers to any one of the following: 
a DNA repair enzyme , a ribozyme, a nucleic acid sequence 
5 encoding a repair enzyme or a ribozyme, a nucleic acid 
sequence comprising a wild type version of a mutant gene, 
and a wild type protein encoded by the wild type gene, an 
antibody specific for the frameshifted (nonsense) protein, 
and oligonucleotides having a wild type sequence to serve as 
10 a template for repair of a frameshift mutation* 

Disease treatment according to the invention may be 
accomplished as follows. In Example 5, administration of 
nucleic acid sequences according to the invention is 
15 described. 

In Example 6, the purification of DNA repair enzymes 
is described. In Example 7, administration of proteins, 
ribozymes, or nucleic acids using liposomes is described* 
20 And in Example 8, delivery of these substances across the 
blood-brain barrier is described. 

Exapple 5 

25 Administration of Nucleic Acids 

Delivery of nucleic acid sequences encoding substances 
which effect or facilitate repair of the frameshifted gene 
is carried out as follows. Sequences encoding repair 

30 enzymes, ribozymes, a wild* type version of a mutant gene, or 
a wild-type oligonucleotide sequence may be cloned into an 
appropriate vector for expression in a desired mammalian 
cell. The vector will include a promoter that is expressed 
in the target cell type, and also may include an enhancer 

3 5 and locus control region, as selected for expression in a 
given cell type. Examples of vectors useful according to 
the invention include but are not limited to any vector 
which results in successful transfer of the coding sequences 
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to the target mammalian cell, including both viral r non- 
viral vectors, .g., r troviral vectors or adenoviral 
vectors. 

5 For example, the retroviral gene transfer vector SAX 

(Kantoff et al., Proc. Nat. Aca. Sci. 83:6563, 1986) may be 
used to insert a selected coding sequence into a target 
cell. SAX is a moloney virus based vector with the neoR 
gene promoted from the retroviral LTR and the human ADA gene 

10 promoted from an internal SV40 promoter. Thus, the SAX 
vector may be engineered by one of skill in the art to 
contain the coding sequence for a DNA repair enzyme, a 
ribozyme, or a wild-type version of a mutant gene, or a 
selected oligonucleotide template sequence, identified as 

15 described herein, e.g., by substituting the desired coding 
region for the hADA coding region in the SAX vector. 

It has been previously shown that ribozymes can be 
targeted to and can correct specific errors via a 

20 transplicing mechanism. See Sullenger & Cech, Nature, 371 . 
pp 619-622, 1994. By correcting the mutant mRNA, the 
correct protein will be translated thus preventing and/or 
treating the disease. One skilled in the art following the 
teaching of Sullenger & Cech would easily be able to design 

25 a ribozyme to correct the mRNA transcript transcribed from 
the mutant gene of the present invention. 

Expression vectors are known in the art which encode, 
or may be engineered to encode, a selected ribozyme. Yuyama 

30 et al. , Nucl. Acids Res. 22:5060, 1994, describe a 
multifunctional expression vector encoding several 
ribozymes. This vector may be adapted to encoded a ribozyme 
of a selected specificity by substituting one or both 
ribozyme sequences in the vector for a selected ribozyme 

35 sequence. Zhou et al., Gene 149:33, 1994, and Yamada et 
al., Virology 205;121, 1994, describe retroviral 
transduction of ribozyme sequences into T cells. These 
retroviral vectors may be adapted to encode a selected 
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ribozyme sequenc . Liu et al., Gene Therapy 1:32, 1994, and 
Le t al., Gene Therapy 2:377, 1995, describe expressi n 
vectors which are adaptable for use in expression of any 
nucleic acid sequence contemplated according to the 
5 invention. 

Once the vector contains the desired coding region, 
the vector may be introduced ex vivo into a selected 
population of cells isolated from a patient, and the 

10 transfected cells then reintroduced into a patient. The 
coding sequence for the repair enzyme, ribozyme, wild-type 
version of a mutant protein, or oligonucleotide, will then 
be expressed in the patient, and the reintroduced population 
of cells may expand and thus provide a cell population in 

15 which the frameshift mutation is corrected. 

Alternatively, the vector may be encapsulated in 
liposomes, as described hereinbelow, and administered to the 
patient. The vector-containing liposomes may be prepared so 

20 as to target a particular cell type suspected or known to 
contain a frameshift mutation in a particular gene, and the 
vector will be introduced into that cell type and result in 
correction of the frameshift mutation, i.e., by virtue of 
the presence of the encoded DNA repair enzyme or by 

25 substitution of the mutant gene sequence with the 
corresponding wild-type sequence. Should the frameshift 
mutation result from a splicing defect, the presence of an 
appropriate ribozyme is expected to result in correction of 
the splicing defect, and thus of the frameshift mutation. 

30 

Example 6 

Preparation of DNA Repair Enzymes 

35 

DNA repair enzymes may be prepared and purified for 
use in the invention as follows. A general review of 
purification methods for DNA r pair enzymes can be found in 
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DNA REPAIR: A Laborat ry Manual of Research Procedures, 

edited by E. Priedberg and P.C- Hanawalt, published by 
Marcel Dekker, New York. 

5 With regard to purification of repair enzymes useful 

according to the invention, the first step of the 
purification process, molecular sieving, serves to separate 
the DNA repair enzymes from the vast majority of proteins 
with larger sizes based on relative rates of migration of 
10 the DNA repair enzymes and the contaminating proteins 
through the molecular sieve matrix. 

Molecular sieving can be accomplished by many methods, 
including gel filtration and electrophoresis. In gel 

15 filtration proteins flow around and through pores in beads 
made from dextran, polyacrylamide, agarose, agarose and 
acrylamide composites, or other material. The size of the 
bead pores include or exclude proteins based on size. In 
electrophoresis, proteins move in an applied electric field 

20 through a sizing matrix. 

The preferred molecular sieving method for use with 
the present invention is gel filtration because the enzyme 
can be easily recovered and because the method is 

25 independent of such factors as net protein charge. The pore 
size of the beads used with this method are selected to 
maximize separation of DNA repair enzymes from the bulk of 
other proteins. A general guideline for selecting the gel 
filtration matrix is that the gel should have an exclusion 

30 limit greater than about twice the molecular weight or 
Stokes 1 radius of the DNA repair enzyme and less than about 
60,000 daltons or 35 Angstroms. 

A wide variety of elution buffers may be used to elute 
35 the DNA repair enzyme from the gel filtration column. The 
selected buffer should satisfy the following criteria: 1) 
the buffer should not denature or inactivate the DNA r pair 
enzyme, 2) the buffer should not permit ionic adsorption of 
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the DNA repair enzyme to the gel filtration media, and 3) 
the buffer should be compatible with loading f the eluate 
onto the nucleic acid affinity column, that is, the elution 
buffer should be chosen so that complexes will form between 
5 the DNA repair enzyme and the immobilized nucleic acids of 
the affinity column. 

The second step of the purification process-nucleic 
acid binding-separates the DNA repair enzymes from the 

10 remaining protein impurities by the ability of DNA repair 
enzymes to reversibly bind to nucleic acids. Separation by 
nucleic acid binding can be accomplished by various methods, 
including nucleic acid affinity chromatography. In this 
method, nucleic acids are immobilized on an inter matrix, 

15 such as agarose, polyacrylamide beads, cellulose or other 
media. Depending on the DNA repair enzyme which is being 
purified, the immobilized nucleic acids may be double- or 
single-stranded DNA, double- or single-stranded RNA, or 
other types, lengths, structures or combination of nucleic 

20 acids, such as tRNA, 2 -DNA, supercoiled DNA, ultraviolet- 
irradiated DNA or DNA modified by other agents. Single- 
stranded DNA is preferred. 

The nucleic acids may be attached to the solid phase 
25 matrix by a variety of methods, including covalent 
attachment of the nucleic acid through primary amines or by 
absorbing the nucleic acids to a matrix such as cellulose, 
which releases nucleic acids slowly. The preferred 
immobilization method is use in a cyanogen-bromide 
30 activated Sepharose and to bind the nucleic acids to the 
activated Sepharose covalent ly. Alternatively, single- 
stranded DNA covalently bound to agarose can be purchased 
commercially from Bethesda Research Labs. Gaithersburg, MD. 
(Catalog No. 5906SA) . 

35 

The DNA repair enzymes are applied to the nucleic 
acids in a solution which should satisfy the following 
criteria: 1) th solution should permit reversible binding 
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of the DNA repair enzyme to the nucleic acids, 2) the 
soluti n should reduce nonspecific binding of contaminating 
proteins to the nucleic acids, and 3) the solution should 
not cause damage to the nucleic acids. in general, a 
5 neutral buffered solution with physiological saline and 1 mM 
EDTA will satisfy these criteria. As discussed above, in 
accordance with the invention, the nucleic acid affinity 
column. Accordingly, the elution buffer used with the 
molecular sieve column should be chosen to satisfy the 
10 foregoing criteria. 



15 



The bound DNA repair enzymes are eluted from the 
nucleic acid affinity column with a gradient which removes 
the enzyme from the nucleic acid at a characteristic 
condition and concentrates the enzyme by the focusing effect 
of the gradient. The elution system, however, should not 
denature the enzyme or introduce contaminants into the final 
product. A gradient of NaCl up to l.OM will in general be 
sufficient to reverse binding of most DNA repair enzymes to 
20 nucleic acids. in appropriate cases, the gradient may be 
one of another salt, increasing or decreasing pH, 
temperature, voltage or detergent, or, if desired, a 
competing ligand may be introduced to replace the nucleic 
acid binding. 



25 



Example 7 



30 



35 



Liposomal Delivery Accordin g to the Invention 

Substances may be administered according to the 
invention using any delivery means known in the art. 
Described below is liposomal delivery. Liposomes which are 
used to administer the DNA repair enzymes, ribozymes, a 
wild-type version of a hybrid wild-type/nonsense protein, or 
nucleic acid sequences encoding enzymes, ribozymes or a 
wild-type protein, antibodies specific for a hybrid wild- 
type/nonsense protein or oligonucleotides can be of various 
types and can have various compositions. The primary 
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restrictions are that the liposomes should not be toxic to 
the living cells and that they should deliv r their cont nts 
into the interior of the cells being treated. 

5 The use of pH sensitive liposomes to mediate the 

cytoplasmic delivery of calcein and FITC dextran has been 
described (see Straubinger et al., Cell 32:1069-1079, 1983; 
and Straubinger et al. , FEBS Letters 179:148-154, 1985. 
Other discussions of pH sensitive liposomes can be found in 

10 chapter 11 of the book CELL FUSION, edited by A.E. Sowers, 
entitled "Fusion of Phospholipid Vesicles Induced by 
Divalent Cations and Protons" by Nejat Duzgunes et al., 
Plenum Press, N.Y., 1987, 241-267. See also Ellens et al., 
Biochemistry, 23:1532-1538, 1984, and Bentz et al., 

15 Biochemistry 26:2105-2116, 1987. 

The liposomes may be of various sizes and may have 
either one or several membrane layers separating the 
internal and external compartments. The most important 

20 elements in liposome structure are that a sufficient amount 
of enzyme or nucleic acid be sequestered so that only one or 
a few liposomes are required to enter each cell for delivery 
of the substance, and that the liposome be resistant to 
disruption. Liposome structures include small unilamellar 

25 vesicles (SUVs, less than 250 angstroms in diameter), large 
unilamellar vesicles (LUVs, greater than 500 angstroms in 
diameter) , and multilamellar vesicles (HLs) . In the example 
presented below, although SUVs are used to administer DNA 
repair enzymes, the methods are applicable to administration 

30 of ribozymes or sequences encoding repair enzymes, 
ribozymes, antibodies specific for hybrid wild- 
type/nonsense proteins, or wild-type genes or their encoded 
proteins, or oligonucleotides. 

35 SUVs can be isolated from other liposomes and 

unincorporated enzyme by molecular weight can be isolated 
from other liposomes and unincorporated enzyme by molecular 
sieve chromatography, which is precise but time consuming 
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and dilutes the liposomes,, or differential centrifugati n, 
which is rapid but produces a wider range of liposome sizes. 

The liposomes may be made from natural and synthetic 
5 phospholipids, glycolipids, and other lipids and lipid 
congeners; cholesterol, cholesterol derivatives and other 
cholesterol congeners; charged species which impart a net 
charge to the membrane; reactive species which can react 
after liposome formation to link additional molecules to the 
10 liposome membrane; and other lipid soluble compounds which 
have chemical or biological activity • 

The liposomes useful according to the invention may 
be prepared, for example, as described in U.S. Patent No. 

15 5,296,231, which describes preparation of liposomes 
containing a DNA repair enzyme, although it should be borne 
in mind that liposomes useful according to the invention may 
contain any one of the substances as herein described. 
Briefly, by combining a phospholipid component with an 

20 aqueous component containing the DNA repair enzyme (or 
desired substance) under conditions which will result in 
vesicle formation. The phospholipid concentration must be 
sufficient to form lamellar structures, and the aqueous 
component must be compatible with biological stability of 

25 the enzyme. Methods for combining the phospholipids onto 
glass and then vesicles will form include: drying the 
phospholipids onto glass and then dispersing them in the 
aqueous component; injecting phospholipids dissolved in a 
vaporizing or non-vaporizing organic solvent into the 

30 aqueous component which has previously been heated; and 
dissolving phospholipids in the aqueous phase with 
detergents and then removing the detergent by dialysis. 
The concentration of the DNA repair enzyme in the aqueous 
component can be increased by lyophilizing the enzyme onto 

35 dried phospholipids and then rehydrating the mixture with a 
reduced volume of aqueous buffer. SUV's can be produced 
fr m the foregoing mixtures either by sonication r by 
dispersing the mixture through either small bore tubing or 
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through the small orifice of a French Press* 

DNA repair enzymes incorporated into liposomes can be 
administered to living cells internally or topically. 
5 Internal administration to animals or humans requires that 
the liposomes be pyrogen-f ree and sterile. To eliminate 
pyrogens, pyrogen-f ree raw materials, including all 
chemicals, enzymes, and water, are used to form the 
liposomes. Sterilization can be performed by filtration of 

10 the liposomes through 0.2 micron filters. For injection, 
the liposomes are suspended in a sterile, pyrogen-f ree 
buffer at a physiologically effective concentration. 
Topical administration also requires that the liposome 
preparation be pyrogen-f ree, and sterility is desirable. In 

15 this case, a physiologically effective concentration of 
liposomes can be suspended in a buffered polymeric glycol 
gel for even application to the skin. In general, the gel 
should not include non-ionic detergents which can disrupt 
liposome membranes. Other vehicles can also be used to 

20 topically administer the liposomes. 

The concentration of the substance in the final 
preparation can vary over a wide range, a typical 
concentration being on the order of 50 ug/ml. In the case 
25 of pH sensitive liposomes, lower concentrations of the 
substance can be used, e.g., on the order of 0.01 to 1.0 
ug/ml for liposomes administered to cells internally. In 
case of topical application, higher liposome concentrations 
used, e.g., ten or more times higher. 

30 

Example 8 

Adminstration Across the Blood-Brain Barrier 

35 Where it is desired according to the invention to 

administer a DNA repair enzyme, a ribozyme, or their nucl ic 
acid coding sequences, a wild-type version of a hybrid wild- 
type/nonsense protein that is associated with the disease, 
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or its coding sequence , or oligonucleotides or th ir coding 
sequences, r liposomes containing such substances, to an 
individual such that the administered material crosses the 
blood-brain barrier, several methods are known in the art* 

5 

For example, a substance to be administered, whether 
it be protein or nucleic acid or liposome, may be co- 
administered with a polypeptide, for example a lipophilic 
polypeptide that increases permeability at the blood-brain 

10 barrier* Examples of such polypeptides include but are not 
limited to bradykinin and receptor mediated permeabilizers, 
such as A- 7 or its conformational analogues, as described in 
U.S. Patent Nos. 5,112,596 and 5,268,164* The 
permeabilizing polypeptide allows the co-administered repair 

15 enzyme, ribozyme, coding sequence or liposome to penetrate 
the blood-brain barrier and arrive in the cerebrospinal 
fluid compartment of the brain, where the repair enzyme, 
ribozyme, or coding sequence may then reach and enter a 
target neuronal cell. Alternatively, the substance to be 

20 administered may be coupled to a steroidal estrogel or 
androgel to increase binding to steroid receptors and thus 
access to the brain. 

Another exemplary method for administering a substance 
25 such as a DNA repair enzyme, ribozyme, antibody, nucleic 
acids, or liposomes containing such molecules, according to 
the invention includes forming a complex between the 
substance to be administered and an antibody that is 
reactive with a transferrin receptor, as described in U.S. 
30 Patent No. 5,182,107. The complex may include a cleavable 
or non-cleavable linker and is administered under conditions 
whereby binding of the antibody to a transferrin receptor 
on a brain capillary endothelial cell occurs and the 
substance is transferred across the blood-brain barrier in 
35 active form. 



SUBSTITUTE SHEET (RULE 26) 



PCT/IB96/01106 

WO 97/12992 63 

Other Dosaaes and Modes of Admin istrat ion 

A patient that is subject to a disease state which is 
associated with a frameshift mutation may be treated in 
5 accordance with the invention, as described above, via In 
vivo , ex vivo or in vitro methods. For example in in YiVP 
treatments, a nucleic acid vector encoding a repair enzyme, 
ribozyme, wild-type version of a hybrid wild-type/nonsense 
protein, or oligonucleotide that corresponds in sequence 

10 (except for the inserted or deleted nucleotide (s) ) to the 
f rameshif ted region of the hybrid wild-type/nonsense protein 
coding region can be administered to the patient, preferably 
in a pharmaceutical^ acceptable delivery vehicle and a 
biologically compatible solution, by ingestion, injection, 

15 inhalation or any number of other methods. The dosages 
administered will vary from patient to patient; an 
"effective dose" will be determined by the level of 
enhancement of function of the transferred genetic material 
balanced against any risk of deleterious side effects. 

20 Monitoring gene expression and/or the presence or levels of 
the encoded mutant protein or its corresponding "sense" 
protein will assist in selecting and adjusting the dosages 
administered. Generally, a composition including a vector 
will be administered in a single dose in the range of 10 ng 

25 - 100 ug/kg body weight, preferably in the range of 100 ng - 
10 ug/kg body weight, such that at least one copy of the 
sequence is delivered to each target cell. A composition 
including a protein, e.g., a DNA repair enzyme or a wild- 
type version of a hybrid wild-type/nonsense protein, will be 

30 administered in single or multiple doses, as determined by 
the physician, in the range of 10 ug - 1 mg, or within the 
range of 100 ug - 50 ug. A composition including a 
nucleoprotein such as a ribozyme will be administered in 
single or multiple doses, as determined by the physician, in 

35 the range of 50 ug - 1 mg, or within the range of 100 ug - 
500 ug. A composition including an oligonucleotide will be 
administered in a single dose in the range of 5 ng - 10 ug, 
or within the range of 100 ng - 500 ng. Any of the above 
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dosages may be administered according to the body weight of 
the pati nt, as determined by the physician* 

Ex vivo transduction is also contemplated within the 
5 present invention. Cell populations can be removed from the 
patient or otherwise provided, transduced with a vector in 
accordance with the invention, then reintroduced into the 
patient. The number of cells reintroduced into the patient 
will depend upon the efficiency of vector transfer, and will 
10 generally be in the range of 10* - 10 6 transduced 
cells/patient. 

The cells targeted for ex vivo gene transfer in 
15 accordance with the invention include any cells to which the 
delivery of the vector is desired, for example, neuronal 
cells or stem cells. 

Protein, nucleic acid, or cells administered according 

20 to the invention is preferably administered in admixture 
with a pharmaceutical ly acceptable carrier substance, e.g., 
magnesium carbonate, lactose, or a phospholipid to form a 
micelle, the carrier and protein, nucleic acid or cell 
together can form a therapeutic composition, e.g., a pill, 

25 tablet, capsule or liquid for oral administration to the 
mammal. Other forms of compositions are also envisioned, 
e.g., a liquid capable of being administered nasally as 
drops or spray, or a liquid capable of intravenous, 
parenteral, subcutaneous, or intraperitoneal administration. 

30 The substance administered may be in the form of a 
biodegradable sustained release formulation for 
intramuscular administration* For maximum efficacy, where 
zero order release is desirable, e.g., an implantable or 
external pump, e.g., an Infusaid™ pump (Infusaid Corp, MA), 

35 may be used. 
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Kits Useful Acc rding t th Invention 

The invention encompasses kits for diagnosis or 
treatment of diseases according to the invention, a kit 
5 including suitable packaging materials and one or more of 
the following reagents: a nucleic acid probe is as defined 
hereinabove, and optionally means for detecting the probe 
when bound to its complementary sequences. For example, the 
nucleic acid probe may be labeled, e.g., radiolabeled, 
10 f luorescently labeled, etc., or may be detected via indirect 
labeling techniques, e.g., using a biotin/avidin system, 
well known in the art. 

A diagnostic system, preferably in kit form, -comprises 
15 yet another embodiment of this invention. This system is 
useful for assaying the presence of a hybrid wild- 
type/nonsense protein or its derivative in cells by the 
formation of an immune complex. This system includes at 
least one package that contains an antibody of this 
20 invention. Optionally, a kit also may include a positive 
tissue sample control. 

Antibodies are also utilized along with an "indicating 
group" also sometimes referred to as a "label 91 . The 
25 indicating group or label is utilized in conjunction with 
the antibody as a means for determining whether an immune 
reaction has taken place, and in some instances for 
determining the extent of such a reaction. 

30 The terms "indicating group" or "label" are used 

herein to include single atoms and molecules that are linked 
to the antibody or used separately, and whether those atoms 
or molecules are used alone or in conjunction with 
additional reagents. Such indicating groups or labels are 

3 5 themselves well-known in immunochemistry and constitute a 
part of this invention only insofar as they are utilized 
with otherwise novel antibodies, methods and/or systems. 
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For example, an antigen-specific antibody or antibody 
fragment is detectably labeled by linking the same to an 
enzyme and use it in an EI A, or enzyme- linked immunosorbent 
assay (ELISA) . This enzyme, in turn, when later exposed to 
5 a substrate in such a manner as to produce a chemical moiety 
which can be detected, for example, by spectrophotometry, 
f lour ome trie or, most preferably, by visual means* The 
substrate may be a chromogenic substrate which generates a 
reaction product visible to the naked eye. 

10 

Enzymes which can be used to detectably label the 
binding protein which is specific for the desired detectable 
mutant protein, include, but are not limited to, alkaline 
phosphatase , horseradish peroxidase , glucose-6-phosphate 

15 dehydrogenase, staphylococcal nuclease, delta -v-steroid 
isomer ase, yeast alcohol dehydrogenase, alpha- 
glycerophosphate dehydrogenase, triose phosphate isomerase, 
asparaginase, ribonuclease, urease, catalase, glucose-6- 
phosphate dehydrogenase, g lucoamy lase and 

20 acetylcholinesterase . 

By radioactively labeling the binding protein, for 
example, the antibody, it is possible to detect the antigen 
bound to a solid support through the use of a 
25 radioimmunoassay (RIA) . The radioactive isotope can be 
detected by such means as the use of a gamma counter or a 
scintillation counter or by autoradiography. Isotopes which 
are particularly useful for the purpose of the present 
invention are: 3 H, 131 1, U C, and preferably 125 I. 

30 

It is also possible to label the first or second 
binding protein with a fluorescent compound. When the 
f luorescently labeled antibody is exposed to light of the 
proper wave length, its presence can then be detected due to 
35 fluorescence. Among the most commonly used fluorescent 
labelling compounds are fluorescein isothiocyanate, 
rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o- 
phthaldehyde and f luorescamine. 
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The first r s c nd binding protein also can be 
detectably labeled by coupling it to a chemi luminescent 
compound* The presence of the chemi luminescent -tagged 
5 antibody is then determined by detecting the presence of 
luminescence that arises during the course of a chemical 
reaction. Examples of particularly useful chemi luminescent 
labeling compounds are luminol, isoluminol, theromatic 
acridinium ester, imidazole, acridinium salt and oxalate 
10 ester* 

Likewise, a bioluminescent compound may be used to 
label the first or second binding protein. Bioluminescence 
is a type of chemi luminescence found in biological systems 
15 in which a catalytic protein increases the efficiency of the 
chemi luminescent reaction. The presence of a bioluminescent 
protein is determined by detecting the presence of 
luminescence. Important bioluminescent compounds for 
purposes of labeling are luciferin, lucif erase and aeguorin. 

20 

The invention also includes diagnostic reagents for 
use in the present invention, such as nucleic acid 
sequences, probes and antibody molecules, and/or positive 
tissue controls, as described above, and kits including such 
25 reagents for use in diagnosing or treating a disease. 

An indicating group or label is preferably supplied 
along with the antibody and may be packaged therewith or 
packaged separately. Additional reagents such as hydrogen 

30 peroxide and diaminobenzideine, and nickel ammonium sulfate 
may also be included in the system when an indicating group 
such as HRP is utilized. Such materials are readily 
available in commerce, as are many indicating groups, and 
need not be supplied along with the diagnostic system. In 

35 addition, some reagents such as hydrogen peroxide decompose 
on standing, or are otherwise short-lived like some 
radioactive elements, and are better supplied by the end- 
user. 
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Other Embodiments 

It will be understood that the invention is described 
by way of illustration only. Many other embodiments of the 
5 present invention In addition to those herein described will 
be apparent to those skilled in the art from the description 
herein given without departing from the scope of the present 
invention as defined in the appended claims* 
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Table 1 

Table 1. Estimated percentages of early (<60 years) and 
late (>60 years) Alzheimer' s disease (AD) onset. 
5 60% non-familial/ 40% familial (Van Broeckhoven, 1995, Eur. 
Neurol. 35, 8-19) 

/ 28% chromosome 14 (Presenilin I, S182) 

10 f 2.4% chromosome 21 (BAPP) 

25% 

\ 1.0% chromosome 1 (Presenilin II , STM2) 



15 



\ 60% sporadic cases 
\ 8.6% sporadic cases 
AD 10-30% chromosome 19 (apolipoprotein E) 

20 75% late onset 

\ 

70% sporadic cases 

According to Ott et al. (Br. Med. J. 310 , 970-973, 1995) the 
25 prevalence of dementia increases exponentially with age. In 
a case finding study in a general population of subjects of 
65 years of age and older, 34% of the subjects aged 85 and 
older had dementia. Of all cases of dementia, 72% were 
cases of Alzheimer' s disease. 
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Table 2 

Table 2. Immunoreactivities within the frontal lobe of the 
human frontal cortex for various neuronal proteins of which 
5 the mKNA is expressed in the +1 reading frame. Tissues were 
obtained from controls and neuropatho logically confirmed 
Alzheimer cases. 
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TABLE 5 



GAGAG MOTIFS 



BASE PAIRS 
(CODING 
SEQUENCE OF 
LONGEST FORM) 



BAPP 
TAU 

UBIQUITIN 
APOLIPOPROTEIN E4 
MAP2 

NF-LOW (68K) 
NF- MEDIUM (145K) 2748 
NF HIGH (200K) 3063 



2234 
1096 
687 
953 
5475 
582 



EXPECTED 
NUMBER 
(1:1024) 

2.2 
1.1 
0.7 
0.9 
5.3 
0.6 
2.7 
3.1 



ACTUAL 
NUMBER 



11 
3 
3 
2 
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RGRTSSKELA 

HGRLAPARHAS 

YADLREDPDRQ 

RQDHHPGSGAQ 

GAPRLPPAQAA 

KTRFQRKGPS 

PGNRSMGHE 

EAEGGSRS 

VGAARDSRAA 



Table 7 
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CLAIMS 

1. A method for the diagnosis of a disease caused by or 
associated with a gene having a somatic mutation giving rise 
to a frameshift mutation comprising: 

i. providing a biological sample from a patient 
suspected of having or developing said disease; and 

ii. detecting in said sample the presence of a mutant 
gene having a frameshift mutation or a protein encoded 
thereby , 

wherein detection is indicative of the disease. 

2. The method of claim l, wherein the frameshift mutation 
comprises a deletion or an insertion of a nucleotide* 

3. The method of claim 2, wherein the frameshift mutation 
is associated with the nucleotide sequence GAGA. 

4 . The method of claim 3 , wherein the frameshift mutation 
comprises a dinucleotide mutation associated with a 
nucleotide sequence comprising GAGA* 

5. The method of claim 3, wherein said sequence comprises 
GAG AX, where X is one of G, A, T, or C. 

6* The method of claim 3, wherein said sequence comprises 
GAGAG or GAGAC. 

7. The method of claim 1, wherein the disease is cancer 
or a neurodegenerative disease. 

8. The method of claim 7, wherein the disease is 
Parkinson's disease, Alzheimer's disease, or Down's 
Syndrome . 

9. The method of claim l wherein the gene having a 
frameshift mutation encodes the ft amyloid precursor protein, 
the Tau protein, ubiquitin, apolipoprotein-E 4 (Apo-E A ) , 
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microtubule associated protein II (HAP 2) or the 
neurofilament pr teins. 

10. The method of claim 1 wherein the biological sample 
comprises body fluid or tissue. 



11. The method of claim 10 wherein said body fluid 
comprises cerebral spinal fluid or blood. 

12. The method of claim 10, wherein the tissue comprises 
skin or nose epithelium. 



13. The method of claim 1, wherein the mutant gene is 
detected by formation of a nucleic acid duplex wherein a 
first strand of said duplex comprises a nucleic acid probe 
having a sequence complementary to part of the mutant gene 
encompassing the mutation giving rise to the frameshift 
mutation, and the second strand of said duplex comprises a 
nucleic acid sequence of the mutant gene which is 
complementary to said probe. 

14. The method of claim 1, wherein the mutant gene is 
detected using PCR to amplify a fragment of the mutant gene 
encompassing the mutation giving rise to the frameshift, and 
then probing for the amplified fragment using a nucleic acid 
probe having a sequence complementary to part of the mutant 
gene encompassing the mutation giving rise to the frameshift 
mutation, or by sequencing the amplified fragment. 

15. The method of claim 1, wherein the protein encoded by 
the mutant gene is detected using an antibody molecule 
having specificity for the mutant protein and not for the 
wild-type protein. 

16. A method for identifying diseases caused by or 
associated with a gene having a somatic mutation giving rise 
to a frameshift mutation comprising: 

i- providing the sequence of a gene suspected of 
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being involved in the pathogenesis of a diseas ; 

ii. identifying the sequence of th mutant protein 
encoded by the gene sequence following a frameshift 
nutation; 

iii. preparing a probe to the mutant protein or a 
fragment thereof; and 

iv. probing a biological sample from a patient having 
the disease and a biological sample from a patient not 
having the disease, 

wherein the presence of said mutant protein in a biological 
sample from a patient having the disease and the absence of 
said mutant protein in a biological sample from a patient 
not having the disease indicates that the presence of the 
mutant protein in a biological sample is a marker for the 
disease or susceptibility to the disease. 

17. A diagnostic kit for diagnosing a disease caused by 
or associated with a gene having a somatic mutation giving 
rise to a frameshift mutation, the kit comprising: 

i. a labelled nucleic acid probe having a sequence 
complementary to part of the mutant gene which 
encompasses the mutation which leads to the frameshift 
mutation; and 

ii. packaging materials therefor. 

18. A diagnostic kit for diagnosing a disease caused by 
or associated with a gene having a somatic mutation giving 
rise to a frameshift mutation comprising: 

i. a pair of primers for use in a PCR reaction, 
wherein said pair comprises sequences complementary 
to sequences on either side of the mutation which 
gives rise to the frameshift mutation, and reagents 
necessary for performing a PCR reaction; and 

ii. packaging materials therefor. 

19. A diagnostic kit for diagnosing a disease caused by 
or associated with at least one gene having one or more 
somatic mutations giving rise to a frameshift mutation 
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comprising: 

i. an antibody molecule having specificity for the 
mutant protein and not the wild-type protein; and 

ii. packaging materials therefor. 

20. A recombinant gene having a frameshift mutation, as 
described in of any one of claims 1 to 9. 

21. The gene of claim 20 encoding at least part of the 
protein sequence designated +1 or +2 shown in any one of 
Figures 2 to 9. 

22. A mutant protein encoded by the gene of claim 20 or 
21. 

23. An immunogenic fragment of the mutant protein of claim 
22. 

24. The mutant protein of claim 22 or the immunogenic 
fragment of claim 23, comprising the amino acid sequence: 

RGRTSSKELA ; 

HGRLAPARHAS; 

YADLREDPDRQ; 

RQDHHPGSGAQ ; 

GAPRLPPAQAA; 

KTRFQRKGPS; 

PGNRSMGHE; 

EAEGGSRS ; or 

VGAARDSRAA. 



25. A pharmaceutical composition comprising a DNA repair 
enzyme admixed with a pharmaceutical ly acceptable carrier. 

26. A method of treatment and/or prevention of a disease 
caused by or associated with a gene having a somatic 
mutation giving rise to a frameshift mutation, comprising 
administering a vector comprising an expressible gene 
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encoding a repair nzyme to a patient suffering from or 
susceptible to the disease. 

27. A vector comprising an expressible gene encoding a 
repair enzyme. 

28. A method of treatment and/or prevention of a disease 
caused by or associated with a gene having a somatic 
mutation giving rise to a frameshift mutation, comprising 
administering a vector comprising an expressible gene 
encoding a ribozyme to a patient suffering from or 
susceptible to the disease. 

29. A vector comprising an expressible gene encoding a 
ribozyme. 

30. A pharmaceutical composition comprising the wild- 
type analog of a mutant protein in admixture with a 
pharmaceutically acceptable carrier. 

31. A method of treatment and/or prevention of a disease 
caused by or associated with a gene having a somatic 
mutation giving rise to a frameshift mutation, comprising 
administering a vector comprising an expressible gene 
encoding the wild-type version of the mutated gene to a 
patient suffering from or susceptible to the disease. 

32. A vector comprising an expressible gene encoding the 
wild-type version of a mutated gene. 

33. A host cell containing a recombinant gene comprising 
a frameshift mutation, as described in claim 20 or 21. 

34. A transgenic animal having somatic cells which contain 
a transgene comprising a frameshift mutation, as described 
in claim 20 or 21. 
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35. The use of a vector encoding one or more repair 
enzymes under the control of a promoter in therapy. 

36 . The use of a vector encoding one or more repair 
5 enzymes under the control of a promoter in the manufacture 

of a composition for the treatment of a disease caused by or 
associated with at least one gene having one or more somatic 
mutations giving rise to a frameshift mutation* 

10 37. The use of a vector encoding a ribozyme under the 
control of a promoter in therapy. 

38. The use of a vector encoding a ribozyme under the 
control of a promoter in the manufacture of a composition 

15 for the treatment of a disease caused by or associated with 
at least one gene having one or more somatic mutations 
giving rise to a frameshift mutation. 

39. The use of a vector encoding the wildtype version of 
20 the mutated gene under the control of a promoter in therapy. 

40. The use of a vector encoding the wildtype version of 
the mutated gene under the control of a promoter in the 
manufacture of a composition for treating a disease caused 

25 by or associated with at least one gene having one or more 
somatic mutations giving rise to a frameshift mutation. 

41. The use of more than one of the vectors of claim 27, 
29 and 32 in any combination in therapy. 

30 

42. The use of more than one of the vectors of claim 27, 
29 and 32 in any combination in the treatment and/or 
prevention of a disease caused by or associated with at 
least one gene having one or more somatic mutations giving 

35 rise to a frameshift mutation. 
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FIG. 1(A) 




FIG. 1(B) 
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